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SPEECH  COMMUNICATION  INDEX  METER: 
FURTHER  APPLICATIONS  AND  IMPROVEMENTS 


Abstract 

To  study  the  feasibility  of  using  the  Speech  Communication 
Index  Meter  (SCIM)  to  evaluate  time-varying  communication  systems, 
recordings  were  made  of  the  transmission  characteristics  of  a 
troposphere-scatter  system.  At  many  specific  points  in  time  in 
these  recordings,  Articulation  Indexes  were  calculated  and  intelli¬ 
gibility  scores  were  obtained  from  listeners  with  the  aid  of  a 
special  test  procedure.  For  most  points,  the  intelligibility  score 
could  be  reasonably  well  predicted  from  the  Articulation  Index. 

This  finding  was  interpreted  as  indicating  that  SCIM  is  potentially 
capable  of  evaluating  time-varying  systems. 

In  a  second  study,  SCIM  was  used  to  predict  the  intelligibility 
of  peak-clipped  speech.  For  a  wide  range  of  signal-to-noise  ratios 
and  clipping  levels,  the  performance  of  SCIM  was  compared  with  in¬ 
telligibility  test  results  reported  in  the  literature.  After  a  minor 
modification,  SCIM  computed  reliable  SCI's  which  could  be  related 
to  the  published  intelligibility  data.  This  study  demonstrated  that 
SCIM  could  accurately  evaluate  communication  systems  employing  peak 
clipping. 

SCIM  was  also  modified  to  reduce  the  computation  time,  and  to 
improve  the  synchronization  between  the  signal  generator  and  the 
analyzer.  In  addition,  various  other  circuit  changes  were  imple¬ 
mented  to  improve  the  performance  of  SCIM. 
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1.  EVALUATION  OP  TIME- VARYING  COMMUNICATION  SYSTEMS 


1 . 1  Introduction 

The  transmission  characteristics  and  level  of  performance 
of  many  types  of  speech-communication  systems  vary  with  time. 

In  high-frequency  radio  systems,  for  example,  multipath  interfer¬ 
ence  is  a  common  occurrence.  Disturbances  in  the  ionosphere  may 
change  the  phase  relations  among  radio  signals  that  reach  the 
receiver  over  different  paths,  and  these  changes  produce  rapid 
fading  of  the  audio  signal  and  fluctuations  in  the  background 
noise.  Fading  is  also  encountered  with  mobile  radio  equipment, 
where  the  path  between  transmitter  and  receiver  is  constantly 
changing.  Another  type  of  time-varying  communication  system  is 
the  troposphere-scatter  system.  The  tropo  system  provides  radio 
communication  over  moderate  distances  that  lie  between  the  capabil¬ 
ities  of  ultrahigh-frequency  short-range  systems  and  low-frequency 
long-range  systems. ^  The  transmission  characteristics  of  tropo 
systems  have  been  studied  in  detail  at  the  National  Bureau  of 
Standards . ^ 

The  performance  of  time-varying  speech-communication  systems 
is  usually  specified  in  terms  of  the  length  of  time  that  a  particu¬ 
lar  signal-to-noise  (S/N)  ratio  is  available.  This  S/N  ratio  is 
determined  from  a  short-term  analysis  of  the  received  signal,  and 
the  time  availability  is  determined  from  a  long-term  analysis. 
Because  S/N  ratio  is  a  measure  which  is  related  to  the  intelligi¬ 
bility  of  the  received  speech,  it  is  a  valid  indicator  of  system 
performance.  However,  wide-band  S/N  ratio  is  insensitive  to 
spectral  changes  in  the  received  noise  which  may  also  affect  intell¬ 
igibility  . 
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A  more  accurate  indicator  of  performance  can  be  obtained 
by  measuring  the  S/N  ratio  in  each  of  several  frequency  bands 
which  are  equally  important  to  speech  intelligibility.  Such 
multi-band  measurements  are  required  for  the  calculation  of  the 

g 

Articulation  Index  (AI),  from  which  the  intelligibility  of 
communication  systems  with  non-varying  characteristics  may  be 
estimated.  Since  the  design  of  the  Speech  Communication  Index 

Q 

Meter  (SCIM)  is  based  upon  the  concept  of  AI,  it  would  appear 
that  SCIM  is  potentially  capable  of  evaluating  time-varying 
systems  more  accurately  than  is  possible  with  presently  available 
methods . 

Before  SCIM  can  be  employed  in  this  manner,  the  relation 
between  the  Speech  Communication  Index  (SCI)  and  speech  intell¬ 
igibility  must  be  established  for  time-varying  systems.  The 
computed  SCI's  must  be  somehow  compared  with  how  well  speakers 
and  listeners  can  communicate  over  the  system.  Such  a  compar¬ 
ison  is  complicated  by  two  factors:  (1)  The  SCI  depends  strongly 
on  the  time  interval  over  which  the  "speech"  and  noise  are 
integrated,  and  (2)  no  standard  test  procedures  are  available  for 
measuring  speech  intelligibility.  In  this  study,  a  test  proce¬ 
dure  was  developed  and  used  to  investigate  how  the  characteristics 
of  a  time-varying  system  should  be  measured  by  a  device  like  SCIM. 

1 . 2  Approach 


The  usual  procedure  for  measuring  speech  intelligibility 
is  as  follows:  One  or  more  trained  speakers  read  several  50-item 
word  lists  to  a  group  of  listeners  over  the  communication  system 
to  be  evaluated.  Depending  on  the  type  of  test,  the  listeners 
are  required  either  to  write  down  each  test  word  heard  or  to  re¬ 
spond  to  each  item  by  selecting  the  test  word  from  a  number  of 
given  alternatives.  The  percentage  of  words  that  are  correctly 
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identified  constitutes  the  test  score.  Because  some  words  in  a 
given  list  are  inherently  more  vulnerable  to  distortion  or  inter 
ference  (and  therefore  more  difficult  to  transmit  over  a  system) 
than  other  words,  the  word  lists  are  fairly  well  balanced  with 
respect  to  the  distribution  of  particular  phonemes  and  overall 
difficulty.  The  system  parameters  are  assumed  to  be  constant 
during  the  presentation  of  the  word  lists. 

If  a  time-varying  system  were  evaluated  in  the  manner  just 
described,  the  intelligibility  of  a  given  test  word  would  depend 
not  only  on  the  relative  difficulty  of  the  word  but  also  on  the 
system  characteristics  at  the  time  the  word  is  transmitted.  The 
overall  test  score  obtained  would  depend  on  how  the  system 
characteristics  happen  to  be  synchronized  with  the  order  of  the 
test  items,  and  would  therefore  be  difficult  to  interpret. 

To  resolve  this  problem,  a  new  test  procedure  was  developed 
The  transmission  characteristics  of  a  typical  tropo  system  were 
recorded  on  magnetic  tape.  Three  3-minute  Program  Tapes  were 
prepared  from  this  recording.  Twenty-five  3-minute  Speech  Tapes 
each  containing  a  sequence  of  60  test  words,  were  also  prepared. 
The  words  on  the  Speech  Tapes  were  spaced  exactly  3  seconds 
apart.  A  given  Program  Tape  was  reproduced  25  times,  each  time 
together  with  a  different  Speech  Tape,  to  provide  25  words  for 
the  calculation  of  each  of  60  intelligibility  scores.  Thus, 
every  3  seconds  an  intelligibility  score  was  obtained  which  was 
based  on  listener  responses  to  25  words.  Even  lists  of  only  25 
words,  often  called  half  lists,  offer  some  degree  of  balance 
with  respect  to  word  difficulty. 
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Each  Program  Tape  was  also  reproduced  through  1/3-octave  band 
filters  to  provide  spectral  information  from  which  AI's  could  be 
computed  at  the  same  60  points  in  time  where  intelligibility  scores 
were  obtained.  These  AI's,  which  represent  scores  that  might  be 
computed  by  a  device  like  SCIM,  were  then  converted  to  equivalent 
intelligibility  scores.  These  latter  scores  were  compared  with  the 
scores  obtained  from  listeners  to  determine  how  closely  the  intell¬ 
igibility  of  the  tropo  system  could  be  predicted  from  AI’s. 

Various  arguments  can  be  advanced  as  to  why  the  predicted 

scores  might  differ  from  the  obtained  scores.  One  argument  is 

based  on  the  fact  that  a  test  word  "samples"  the  characteristics 

of  the  tropo  system  for  the  entire  duration  of  the  word,  whereas  the 

AI  is  computed  at  one  particular  point  in  time.  Results  which 

support  this  argument  will  probably  also  provide  useful  information 

regarding  the  optimum  duration  of  SCIM  measurements.  Another  argu- 
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ment  involves  a  phenomenon  known  as  temporal  masking.  *  ’ 

Temporal  masking  has  been  demonstrated  in  experiments  such  as  those 
illustrated  in  Pig.  1-1. 

A  probe  tone  occurring  shortly  before  or  after  an  intense 
burst  of  white  noise  will  be  masked  by  the  noise  burst  even  though 
the  two  signals  are  presented  sequentially  to  the  listener.  The 
level  of  masking  depends  primarily  on  the  intensity  of  the  noise 
burst  and  on  the  interval  between  the  two  signals.  Temporal 
masking  appears  to  be  negligible  for  intervals  greater  than  100 
milliseconds . 
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FIG. 1-1  TEMPORAL  MASKING  OF  PROBE  TONE  OCCURRING 
BEFORE  OR  AFTER  BURST  OF  WHITE  NOISE.  DATA 
FROM  ELLIOTT  1962  (• — •)  AND  PICKETT  1959  (o— o). 


How  temporal  masking  could  Influence  the  results  of  the 
present  study  is  outlined  in  the  hypothetical  situation  shown  in 
Fig.  1-2.  A  number  of  test  words,  assumed  to  be  equally  difficult, 
are  transmitted  over  a  time-varying  system  at  approximately  the 
same  level.  These  words  are  received  at  different  levels,  to¬ 
gether  with  widely  fluctuating  background  noise.  At  the  location 
of  each  test  word,  the  speech  and  noise  spectra  are  determined 
and  the  AI  is  calculated.  The  AI  values  are  converted  to  pre¬ 
dicted  test  scores.  Test  scores  are  also  obtained  from  listeners. 

Suppose  that  Test  Words  A,  B,  and  C  are  received  at  the  same 
level.  Because  Words  B  and  C  occur  during  an  interval  character¬ 
ized  by  a  low  noise  level,  their  AI  values  and  predicted  scores 
are  much  higher  than  the  AI  value  and  predicted  score  for  Word  A. 
The  obtained  scores  for  these  three  words  may,  however,  be  very 
similar.  Word  B,  occurring  immediately  after  the  cessation  of  a 
high  noise  level,  may  be  subjected  to  forward  masking.  Similarly, 
Word  C  may  be  subjected  to  backward  masking. 

Considering  the  critical  conditions  that  must  exist  for 
temporal  masking  to  occur,  the  effect  is  not  likely  to  be  as  pro¬ 
nounced  as  is  illustrated  in  Fig.  1-2.  Nevertheless,  if  some 
results  can  be  interpreted  in  terms  of  this  argument,  then  a 
correction  for  temporal  masking  should  be  included  in  the  design 
of  SCIM. 


1 . 3  Preparation  of  Program  Tapes 

A  block  diagram  of  the  instrumentation  used  to  record  the 
Program  Tapes  is  shown  in  Fig.  l-3(a).  An  experimental  Air  Force 
tropo  system  was  used,  which  consisted  of  a  transmitter  and  a 
receiver  separated  by  200  miles.  Although  the  system  could 
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FIG. 1-2  HYPOTHETICAL  CHARACTERISTICS  OF 
TIME-VARYING  COMMUNICATIONS  SYSTEM. 
INTELLIGIBILITY  OF  TEST  WORDS  B  AND  C  IS 
LOWER  THAN  PREDICTED  FROM  ARTICULATION 
INDEX  BECAUSE  OF  TEMPORAL  MASKING. 
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FIG.  1-3  BLOCK  DIAGRAMS  OF  INSTRUMENTATION  USED  TO 
(a)  RECORD  PROGRAM  TAPE,  (b)  PREPARE  COMBINATION 
TAPE,  AND  (c)  REPRODUCE  COMBINATION  TAPE. 


provide  several  modes  of  diversity  operation,  these  were  not 
employed.  A  constant-level  1  KHz  tone  was  applied  to  one  of 
the  voice  channels  at  the  transmitter  to  represent  the  speech 
signal  ordinarily  transmitted  over  the  system.  The  output  of 
the  receiver  was  processed  by  two  filters.  One  filter  was  a 
sharp-skirted  bandpass  filter  centered  on  the  tone,  and  the 
other  filter  was  a  complementary  sharp-skirted  band-elimination 
filter.  The  output  of  the  bandpass  filter  served  to  describe 
how  the  level  of  a  received  speech  signal  might  vary  as  a 
function  of  time.  The  output  of  the  band-elimination  filter 
described  how  the  level  and  spectrum  of  the  received  noise 
varied.  The  two  filter  outputs  were  recorded  simultaneously 
with  a  two-channel  tape  recorder. 

The  instrumentation  just  described  is  based  on  the  assump¬ 
tion  that  the  spectrum  of  a  received  speech  signal  remains  con¬ 
stant  when  the  level  of  the  signal  changes.  The  validity  of 
this  assumption  was  determined  in  an  earlier  experiment  with 
another  tropo  system.  Nearly  all  tropo  systems  employ  frequency 
modulation  and  transmit  a  wide-band  channel  from  which  a  number 
of  voice  channels  are  derived  with  multiplex  equipment.  A 
constant-level  1  KHz  tone  was  applied  simultaneously  to  several 
voice  channels  at  the  transmitter,  and  the  levels  of  the  received 
tones  were  recorded  with  a  multi-channel  chart  recorder.  It  was 
noted  that  the  recorded  levels  tended  to  vary  in  unison  during 
intervals  of  fading.  Since  the  tones  correspond  to  different 
frequencies  in  the  wide-band  signal,  it  was  concluded  that  the 
spectrum  of  the  wide-band  signal  is  not  appreciably  altered  when 
level  changes  occur. 
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Various  transmitter  power  settings  were  employed  in  re¬ 
cording  the  output  of  the  tropo  receiver.  After  listening  to 
the  recordings  obtained  and  observing  graphic-level  tracings 
of  the  time-varying  noise,  a  30-minute  recording  made  at  a 
power  setting  of  600  watts  was  selected.  This  recording  was 
reduced  to  three  3-minute  sections  to  furnish  the  three  Program 
Tapes  which  were  used  in  this  study.  The  editin';  was  accomplished 
so  that  different  types  of  interference  patterns  were  included 
and  so  that  the  test  words  (on  the  Speech  Tapes)  would  occur  in 
a  variety  of  positions  with  respect  to  the  interference.  A 
short  burst  of  white  noise  was  inserted  at  the  beginning  of  eacn 
Program  Tape  to  facilitate  accurate  synchronization  with  the 
Speech  Tapes. 


1 . 4  Preparation  of  Speech  Tapes 

It  was  considered  desirable  to  obtain  as  many  intelligibility 
scores  as  possible  for  each  Program  Tape  without  extending  the 
duration  of  the  tape  beyond  3  minutes.  The  duration  of  each 
Program  Tape  determines  the  length  of  a  listening  test,  and 
past  experience  in  intelligibility  testing  has  indicated  that 
listeners  may  become  fatigued  when  a  test  lasts  longer  than  3-5 
minutes.  Furthermore,  it  was  anticipated  that  the  synchroniza¬ 
tion  between  Program  Tapes  and  Speech  Tapes  could  be  maintained 
more  accurately  for  3-minute  tapes  than  for  tapes  of  longer 
duration . 

On  the  basis  of  these  considerations,  the  Modified  Rhyme 
Test  ( MRT ) u  was  selected.  Since  MRT  words  can  be  readily  pre¬ 
sented  and  marked  by  listeners  at  the  rate  of  one  word  every 
3  seconds,  60  intelligibility  scores  could  be  obtained  for  each 
3-minute  Program  Tape.  The  speech  material  of  the  MRT  consists 
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of  six  50-word  lists  which  are  approximately  equally  difficult. 
Each  list  is  composed  of  25  words  in  which  the  initial  conson¬ 
ant  is  tested  and  25  words  in  which  the  final  consonant  is 
tested . 

For  the  present  application,  a  rearrangement  of  this 
material  appeared  to  be  necessary.  If  the  standard  50-word 
lists  were  to  be  used,  50  Speech  Tapes  would  be  required.  The 
effort  of  preparing  and  administrating  this  many  Speech  Tapes 
seemed  unwarranted.  Also,  considering  that  60  intelligibility 
scores  were  to  be  obtained  for  each  Program  Tape,  it  seemed 
desirable  to  use  more  than  six  basic  lists.  It  was  therefore 
decided  to  construct  twelve  25-word  lists.  In  an  effort  to 
construct  lists  of  equal  difficulty,  two  sets  of  data  obtained 
with  the  MRT  were  examined. 

Data  gathered  during  the  development  and  evaluation  of  the 
MRT^  consisted  of  scores  for  each  of  the  50  words  in  each  list, 
averaged  over  2  speakers,  18  listeners,  and  6  S/N  ratios.  The 
25  ensembles  with  initial  variable  consonants  were  divided  into 
two  groups  (of  12  and  13  ensembles)  that  would  have  the  same 
mean  score.  The  25  ensembles  with  final  variable  consonants 
were  similarly  divided  into  two  groups.  Mean  scores  were  then 
computed  for  the  resulting  12-  and  13-item  lists.  In  order  to 
achieve  similar  scores  across  the  12  related  lists  (variable 
consonant  in  same  position),  some  words  were  permuted  within 
their  respective  ensembles.  Twelve  25-item  lists  with  similar 
mean  scores  were  thus  obtained. 
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Other  available  data  consisted  of  scores  for  each  word 
in  each  list,  averaged  over  1  speaker,  13  listeners,  and  2 
S/N  ratios.  Using  these  data,  the  twelve  25-item  lists  were 
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again  examined  for  equal  difficulty.  The  original  grouping  of 
ensembles  was  changed  slightly,  and  more  words  were  permuted 
within  ensembles,  to  achieve  similar  scores  across  the  12  related 
lists.  At  the  same  time,  care  was  taken  to  maintain  similar 
scores  across  related  lists  for  the  first  set  of  data.  While  an 
attempt  was  made  to  balance  the  lists  with  respect  to  the 
occurrence  of  particular  phonemes,  the  requirement  for  equal 
difficulty  was  given  priority. 

The  revised  25-item  lists,  referred  to  as  Half  Lists  A-L, 

are  given  in  Appendix  1.  The  relative  difficulty  of  these 

22 

lists  was  determined  experimentally  in  another  study.  The 
scores  that  were  obtained  for  each  list  at  each  of  5  S/N  ratios 
are  given  in  Appendix  2.  Although  some  lists  appeared  to  be 
slightly  more  difficult  than  others,  the  balance  that  was 
achieved  was  considered  adequate. 

Five  randomizations  of  the  12  half  lists  were  prepared  to 
provide  a  half  list  at  each  of  60  points  in  time.  The  first 
word  in  each  of  the  60  randomized  half  lists  was  identified  as 
a  word  to  be  included  in  Speech  Tape  1.  In  the  same  manner, 
the  remaining  words  in  each  half  list  were  marked  for  inclusion 
in  Speech  Tapes  2-25-  Each  Speech  Tape  would  thus  consist  of 
60  words,  but  these  words  do  not  constitute  any  particular  half 
list.  The  relation  between  the  contents  of  a  given  Program 
Tape  and  the  test  words  of  the  25  Speech  Tapes  is  illustrated 
in  Fig.  1-4. 

The  25  Speech  Tapes  were  then  recorded  by  an  adult  male 
speaker  with  considerable  experience  in  reading  test  materials. 
The  recordings  were  made  with  high-quality  equipment  in  a 
sound-treated  studio.  A  flashing  light  signal  was  used  during 
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FIG. 1-4  DIAGRAM  SHOWING  HOW  A  PROGRAM  TAPE  WAS  COMBINED 
WITH  25  SPEECH  TAPES  TO  PROVIDE  A  25-WORD  HALF  LIST 
AT  EACH  OF  60  POINTS  IN  TIME  OF  PROGRAM  TAPE. 


the  recording  to  help  the  speaker  maintain  the  required 
3-second  interval  between  words.  This  light  signal  was  con¬ 
trolled  from  a  calibrated  pulse  generator.  Although  the  words 
were  not  embedded  in  a  carrier  phrase  and  no  item  numbers  were 
recorded,  the  speaker  tried  to  read  all  words  for  a  given 
Speech  Tape  with  the  same  vocal  effort.  When  the  speaker  made 
a  mistake,  the  entire  Speech  Tape  in  which  the  mistake  occurred 
was  recorded  again. 

To  check  the  accuracy  of  the  timing  of  the  words,  graphic- 
level  tracings  were  made  of  each  Speech  Tape.  These  tracings 
were  aligned  with  a  time  scale  that  indicated  the  exact  positions 
where  the  words  should  occur.  Corrections  were  calculated  for 
those  words  which  occurred  more  than  100  milliseconds  too  early 
or  too  late,  and  these  corrections  were  then  used  to  edit  the 
Speech  Tapes.  A  short  burst  of  white  noise  was  inserted  at  the 
beginning  of  each  Speech  Tape  to  facilitate  accurate  synchroniza¬ 
tion  with  the  Program  Tapes. 


1 . 5  Preparation  and  Administration  of  Combination  Tapes 

To  provide  synchronism  between  a  Program  Tape  and  a  Speech 
Tape,  two  tape  recorders  were  coupled  electrically  so  that  the 
tapes  could  be  started  simultaneously.  Visual  markers  on  both 
sets  of  tapes,  positioned  just  before  the  noise  bursts,  were 
used  to  accomplish  the  initial  alignment.  During  the  subsequent 
reproduction  of  the  two  tapes,  a  progressive  deterioration  of 
synchronization  was  observed  in  some  cases.  This  problem  could 
be  related  to  a  dimensional  instability  of  the  tape  material 
(acetate  base),  and  was  resolved  by  using  a  special  variable- 
frequency  power  source  to  operate  the  capstan  motor  of  one  of 


the  tape  recorders.  Because  of  this  requirement ,  it  was  con¬ 
sidered  necessary  to  combine  the  Program  Tapes  and  Speech  Tapes 
prior  to  their  presentation  to  listeners. 

A  block  diagram  of  the  instrumentation  used  to  prepare  the 
Combination  Tapes  is  shown  in  Fig.  l-3(b).  The  1  KHz  tone 
reproduced  from  the  Program  Tape  was  rectified  and  low-pass 
filtered,  and  the  resulting  d-c  signal  was  applied  to  a  modula¬ 
tor.  This  modulator  was  used  to  vary  the  level  of  the  speech 
signal  reproduced  from  the  Speech  Tape.  The  output  of  the 
modulator  and  the  time-varying  noise  reproduced  from  the  Program 
Tape  were  recorded  with  a  two-channel  tape  recorder.  This 
arrangement  allowed  independent  settings  of  the  speech  and  noise 
levels  during  the  administration  of  the  Combination  Tapes. 

Since  3  Program  Tapes  had  to  be  combined  with  25  Speech 
Tapes,  a  total  of  75  Combination  Tapes  were  prepared.  These 
Combination  Tapes  were  ordered  according  to  the  test  design  given 
in  Appendix  3  and  administered  to  a  group  of  listeners  during 
five  test  sessions.  The  listeners  were  provided  with  response 
forms  of  the  type  shown  in  Appendix  4.  A  block  diagram  of  the 
instrumentation  used  for  administering  the  Combination  Tapes  is 
shown  in  Fig.  1-3 (c).  The  speech  attenuator  was  adjusted  so  that 
the  maximum  sound  pressure  level  of  the  speech  (corresponding  to 
the  highest  level  of  the  1  KHz  tone  on  the  Program  Tapes)  was 
71  dB  re  0.0002  microbar  under  the  earphones.  Similarly,  the 
noise  attenuator  was  adjusted  so  that  the  maximum  sound  pressure 
level  of  the  noise  was  78  dB  re  0.0002  microbar.  The  results 
obtained  from  10  listeners  are  tabulated  in  Appendix  5. 

For  each  Program  Tape  and  Speech  Tape,  Appendix  5  shows 
the  number  of  errors  that  were  made  by  10  listeners  at  each 
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point  in  time  (referred  to  as  a  word  location).  Also  shown  for 
each  Program  Tape  are  the  mean  and  standard  deviation  of  the 
errors  made  at  each  word  location  on  all  Speech  Tapes.  The 
means  are  also  expressed  as  intelligibility  scores  in  percent, 
corrected  for  chance.*  Large  standard  deviations  (over  3-20) 
at  particular  word  locations  suggest  that  rapid  changes  in  the 
level  of  the  noise  occurring  at  those  times  may  have  disturbed 
the  normal  balance  in  difficulty  between  words  with  initial 
consonants  and  words  with  final  consonants  under  test.  If,  for 
example,  the  noise  level  increased  rapidly  during  the  presenta¬ 
tion  of  the  words  constituting  a  half  list,  the  initial  conson¬ 
ants  would  be  masked  less  than  the  final  consonants.  As  a 
result,  words  with  initial  consonants  under  test  would  be  more 
intelligible  than  words  with  final  consonants  under  test.  The 
intelligibility  scores  shown  for  such  word  locations  are  there¬ 
fore  less  reliable. 


1.6  Calculation  of  Articulation  Index 


In  order  to  calculate  an  AI  at  each  word  location  in  each 
Program  Tape,  it  was  first  necessary  to  subject  the  Program 
Tapes  to  a  detailed  spectral  analysis.  The  instrumentation  used 
to  obtain  the  required  spectral  data  is  represented  in  Fig.  1-5. 

A  graphic-level  tracing  was  prepared  of  the  1  KHz  tone  to  pro¬ 
vide  information  regarding  the  speech  level  at  each  word  location. 


*  All  obtained  intelligibility  scores  were  corrected  for  chance 
with  the  formula: 

Corrected  Score  =  Items  Correct  -  (Items  Incorrect )/5 
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This  tracing  was  calibrated  in  terms  of  the  maximum  level  at 
which  the  speech  had  been  presented  to  the  listeners.  The 
time-varying  noise  was  reproduced  sequentially  through  fifteen 
1/3-octave  band  filters.  The  center  frequencies  of  these 
filters  ranged  from  200  Hz  to  5  KHz.  A  graphic-level  tracing 
was  prepared  from  the  output  of  each  filter;  these  tracings 
were  calibrated  in  terms  of  the  maximum  overall  level  at  which 
the  noise  had  been  presented. 

By  means  of  a  time  scale  common  to  all  tracings,  the  speech 
level  and  the  noise  spectrum  at  each  word  location  could  now  be 
uniquely  determined.  The  tracings  were  read  at  each  word  loca¬ 
tion,  and  from  this  information  the  AI's  were  calculated.  An 
example  of  such  a  calculation  is  given  in  Fig.  1-6.  For  this 
particular  word  location  (1:52),  the  tracing  of  the  1  KHz  tone 
was  read  as  69  dB.  This  is  5  dB  below  the  overall  level  which 
corresponds  to  a  theoretical  reference  speech  spectrum.*  The 
"measured"  speech  spectrum  is  therefore  shown  5  dB  below  the 
reference  speech  spectrum.  The  measured  noise  spectrum  is 
determined  directly  from  the  15  graphic-level  tracings  of  the 
noise.  For  this  example,  the  AI  was  calculated  as  shown  in 
Appendix  6. 

The  AI's  calculated  in  this  manner  for  all  word  locations 
in  each  of  the  three  Program  Tapes  are  tabulated  in  Appendix  7 . 
The  calculations  were  carried  out  with  the  aid  of  a  digital 
computer.  Using  the  relation  between  AI  and  MRT  scores  shown 


*  As  required  by  the  procedure  for  calculating  AI,  the  reference 
speech  spectrum  includes  a  12  dB  correction  for  speech  peaks. 
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1/3'  OCTAVE  BAND  LEVEL  IN  dB  (RE  0.0002  MICROBAR  ) 
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in  Fig.  1-7  (solid  curve),  the  calculated  AI ' s  were  then  con¬ 
verted  to  equivalent  MRT  scores.  The  data  shown  in  Fig.  1-7 
were  obtained  in  a  related  study22  which  deserves  some  comment. 

The  related  study  was  concerned  with  the  masking  of  speech 
by  aircraft  noise.  Recordings  of  various  aircraft  flyovers 
were  assembled  into  three  3-minute  Program  Tapes  which  were 
similar  to  the  Program  Tapes  prepared  in  the  present  study. 

Both  studies  employed  the  same  25  Speech  Tapes  and  the  identical 
test  procedure  for  obtaining  intelligibility  scores.  In  the 
aircraft  study,  AI ' s  were  also  computed  at  the  60  word  locations 
of  each  Program  Tape.  One  important  difference  between  the  two 
studies  was  the  rate  at  which  the  noise  varied.  Whereas  the 
overall  noise  level  in  the  present  study  could  change  by  as  much 
as  25  dB  within  an  interval  of  4 00  milliseconds  (average  duration 
of  test  words),  the  overall  level  of  the  aircraft  noise  never 
changed  by  more  than  5  dB  during  the  same  interval.  Another 
difference  was  the  fact  that  in  the  aircraft  study  the  Speech 
Tapes  were  presented  to  listeners  at  a  uniform  level.  Consider¬ 
ing  these  differences,  it  was  assumed  that  a  more  reliable 
relation  between  AI  and  MRT  scores  could  be  obtained  from  the 
aircraft  study  than  from  the  present  study. 

As  indicated  in  Fig.  1-7,  the  aircraft  study  provided  two 
relations;  one  was  obtained  with  time-varying  noise  (aircraft 
flyovers),  and  the  other  with  steady-state  noise.  For  a  given 
AI,  the  time-varying  noise  masked  the  test  words  to  a  lesser 
extent  than  the  steady-state  noise.  This  difference  in  masking 
may  be  attributable  to  such  factors  as  listener  attention, 
adaptation,  and  fatigue.  The  relation  obtained  with  time-varying 
noise  was  considered  more  appropriate  for  the  present  study  than 
the  other  relation.  The  use  of  this  relation,  however,  excluded 
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FIG. 1-7  RELATION  BETWEEN  ARTICULATION  INDEX 
AND  HALF-LIST  MODIFIED  RHYME  TEST  SCORE 
(CORRECTED  FOR  CHANCE  )  FOR  TIME-VARYING 
AND  STEADY-STATE  AIRCRAFT  NOISE.. 
DATA  FROM  WILLIAMS  ET  AL.  1967. 


an  investigation  of  the  above  mentioned  factors  which  might  be 
relevant  to  the  design  of  SCIM. 


1 . 7  Comparison  Between  Obtained  and  Predicted  Scores 

For  each  word  location  of  each  Program  Tape,  the  intell¬ 
igibility  score  obtained  from  the  listeners  was  compared  with 
the  score  predicted  from  the  AI  calculation.  Plots  of  the 
relation  between  these  obtained  and  predicted  scores  are  shown 
for  Program  Tapes  I,  II,  and  III  in  Figs.  1-8,  1-9,  and  1-10, 
respectively.  Each  point  in  these  plots  represents  a  word 
location;  the  obtained  scores  for  word  locations  denoted  by 
solid  points  are  considered  more  reliable  than  the  obtained 
scores  for  word  locations  denoted  by  open  points.  (See  Appendix  5-) 
Word  locations  where  the  obtained  scores  were  predicted  correctly 
can  be  identified  by  points  that  lie  on  an  imaginary  diagonal 
line.  The  horizontal  displacement  of  points  from  such  a  line 
is  a  measure  of  how  accurately  the  obtained  scores  were  pre¬ 
dicted  . 

To  facilitate  an  analysis  of  the  incorrect  predictions,  it 
was  necessary  to  define  a  region,  centered  about  the  diagonal 
line,  over  which  the  obtained  scores  could  be  considered 
correctly  predicted.  This  region,  shown  in  Figs.  1-8,  1-9,  and 
1-10  by  two  dashed  lines,  was  adopted  from  the  aircraft  study. 

When  the  data  obtained  in  the  aircraft  study  were  arranged  in 
the  same  manner  as  the  present  data,  the  chosen  region  enclosed 
156  of  the  180  word  locations  (87  percent  of  all  data). 

Assuming  that  the  distribution  of  the  aircraft  data  was  only 
moderately  skewed,  the  region  corresponded  to  approximately 
1.5  standard  deviations,  measured  in  one  direction  from  the  mean. 
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OBTAINED  HALF-LIST  MRT  SCORE  IN  PERCENT 


FIG. 1-8  PLOT  OF  PREDICTED  VERSUS  OBTAINED 

HALF-LIST  MRT  SCORES  (CORRECTED  FOR  CHANCE) 
FOR  60  POINTS  IN  TIME  OF  PROGRAM  TAPE  I. 
DIAGONAL  LINES  ARE  EXPLAINED  IN  TEXT. 


OBTAINED  HALF-LIST  MRT  SCORE  IN  PERCENT 


FIG. 1-9  PLOT  OF  PREDICTED  VERSUS  OBTAINED 

HALF-LIST  MRT  SCORES  (  CORRECTED  FOR  CHANCE) 
FOR  60  POINTS  IN  TIME  OF  PROGRAM  TAPE  H. 
DIAGONAL  LINES  ARE  EXPLAINED  IN  TEXT. 
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FIG. 1-10  PLOT  OF  PREDICTED  VERSUS  OBTAINED 
HALF-LIST  MRT  SCORES  ( CORRECTED  FOR  CHANCE) 
FOR  60  POINTS  IN  TIME  OF  P  R  OGR  AM  T  A  PE  m  . 
DIAGONAL  LINES  ARE  EXPLAINED  IN  TEXT. 


In  an  effort  to  determine  why  the  obtained  scores  were 
predicted  incorrectly  at  certain  word  locations,  the  graphic- 
level  tracings  of  the  Program  Tapes  were  carefully  examined. 

This  examination  revealed  that  at  the  word  locations  of  interest 
either  the  speech  level,  the  noise  level,  or  both  levels  varied 
considerably.  As  a  consequence,  different  portions  of  a  given 
word  were  subjected  to  different  amounts  of  interference.  Con¬ 
sidering  the  nature  of  the  MRT,  those  portions  containing  the 
initial  and  final  consonants  were  probably  more  important  in 
the  determination  of  the  obtained  score  than  any  other  portions. 
The  AI  calculation,  however,  from  which  the  predicted  score  was 
determined,  only  sampled  the  interference  at  one  point  in  time. 
This  point  corresponded  to  the  central  portion  of  the  word  which 
contains  the  vowel. 

All  incorrectly  predicted  intelligibility  scores  could  be 
explained  in  terms  of  this  general  finding:  At  the  boundaries 
of  the  average  word  duration  (approximately  ^00  milliseconds) 
where  the  initial  and  final  consonants  occurred,  the  interference 
conditions  were  appreciably  different  from  the  conditions  that 
prevailed  in  the  center  of  the  duration,  where  the  AI  was 
calculated.  The  word  locations  of  interest  were  ordered  into 
two  groups  according  to  whether  the  predicted  test  score  was 
too  low  or  too  high.  Within  each  group,  the  word  locations 
were  further  divided  according  to  the  probable  cause  of  the 
incorrect  prediction.  These  groupings  are  given  in  Appendix  8. 
Five  arbitrarily  selected  examples  from  each  group  are  also 
illustrated  in  Figs.  1-11  and  1-12. 
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PREDICTED  TEST  SCORE  TOO  LOW 
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FIG. 1-12  SPEECH  AND  OVERALL  NOISE  LEVEL  TRACINGS  FOR  SOME  WORD 
LOCATIONS  WHERE  THE  PREDICTED  TEST  SCORE  WAS  TOO  HIGH. 


Consider  the  level  tracings  for  Word  Location  I:  41,  shown 
in  Fig.  1-11.  The  temporal  course  of  the  speech  level  suggests 
that  the  initial  and  final  consonants  were  barely  attenuated. 

This  was  not  taken  into  account  by  the  AI  calculation.  Because 
the  speech  level  used  in  the  AI  calculation  was  read  too  low  on 
the  tracing,  the  predicted  test  score  was  too  low.  To  determine 
the  amount  by  which  the  speech  level  was  read  too  low,  the  AI 
calculation  was  repeated  with  various  higher  speech  levels  until 
the  result  approximated  the  AI  value  that  corresponded  to  the 
obtained  score  (Fig.  1-7)*  For  this  example,  the  correction 
factor  was  +5  dB,  as  is  indicated  in  the  lower  part  of  Fig.  1-11. 

The  level  tracings  for  Word  Location  111:28  illustrate  the 
case  where  the  noise  level  was  read  too  high.  Although  the 
position  of  the  final  consonant  was  masked  more  than  was  re¬ 
flected  in  the  AI  calculation,  the  position  of  the  initial 
consonant  was  masked  much  less.  As  a  result,  the  predicted  test 
score  was  again  too  low.  The  corrected  AI  calculation  indicated 
that  the  noise  level  was  read  14  dB  too  high.  The  level  tracings 
for  this  and  several  other  word  locations  showed  an  interesting 
trend:  The  speech  and  noise  levels  used  in  the  corrected  AI  calcu¬ 

lations  tended  to  agree  more  with  the  levels  existing  at  the 
position  of  the  initial  consonant  than  with  those  existing  at 
the  position  of  the  final  consonant.  No  satisfactory  explanation 
could  be  found  for  this  trend. 

Now  consider  the  level  tracings  for  Word  Location  1:8, 
shown  in  Fig.  1-12.  The  speech  level  used  in  the  AI  calculation 
did  not  take  into  account  the  extreme  attenuation  at  the  posi¬ 
tion  of  the  initial  consonant  and  was  therefore  read  too  high. 
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It  is  also  possible  that  the  noise  level  was  read  too  low,  since 
the  noise  level  used  in  the  AI  calculation  was  lower  than  the 
noise  level  which  existed  at  the  position  of  the  initial  conson¬ 
ant.  While  both  of  these  factors  undoubtedly  contributed  to 
a  predicted  test  score  that  was  too  high,  it  was  difficult  to 
determine  which  factor  was  the  dominant  one.  From  an  inspection 
of  the  traces,  and  from  the  knowledge  that  a  total  level  change 
of  7  dB  was  required,  it  was  determined  that  the  speech  level 
was  primarily  responsible  for  the  incorrect  prediction. 

Five  word  locations  for  which  the  predicted  score  was 
nearly  the  same  as  the  obtained  score  we re  randomly  selected 
from  the  three  Program  Tapes.  The  level  tracings  for  these  word 
locations  are  shown  in  Fig.  1-13*  The  speech  and  noise  levels 
appear  to  be  generally  more  uniform  over  the  word  durations 
than  the  levels  shown  in  Figs.  1-11  and  1-12.  With  the  possible 
exception  of  the  noise-level  tracing  for  Word  Location  11:38 
and  the  speech-level  tracing  for  Word  Location  I:51>  the  tracings 
do  not  exhibit  any  features  which  might  have  led  to  incorrect 
predictions . 


1 . 8  Conclusions 

Within  the  criterion  of  accuracy  employed  in  this  study, 
approximately  76  percent  of  the  intelligibility  scores  obtained 
from  listeners  could  be  correctly  predicted  from  the  computed 
AI's.  All  incorrect  predictions  could  be  attributed  to  the 
fact  that,  whereas  an  intelligibility  score  was  influenced  by 
the  system  characteristics  for  about  400  milliseconds,  the  AI 
was  essentially  computed  for  a  single  point  in  time.  Contrary 
to  theoretical  expectations,  the  incorrect  predictions  could 
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PREDICTED  TEST  SCORE  CORRECT 


FIG. 1-13  SPEECH  AND  OVERALL  NOISE  LEVEL  TRACINGS  FOR  SOME  WORD 
LOCATIONS  WHERE  THE  PREDICTED  TEST  SCORE  WAS  CORRECT. 


not  be  related  to  temporal  masking.  The  results  of  this  study 
suggest  that  fewer  Incorrect  predictions  would  have  been  made 
if  the  speech  and  noise  level  tracings  had  both  been  averaged 
over  a  400  millisecond  interval  at  each  word  location.  Perhaps 
the  number  of  incorrect  predictions  could  have  been  reduced 
even  further  if,  in  obtaining  average  speech  and  noise  levels 
for  each  AI  calculation,  the  initial  portion  of  the  interval 
had  been  given  more  weight  than  the  final  portion. 

The  computed  AI's  were  intended  to  represent  scores  which 
might  be  obtained  with  a  device  like  SCIM.  Despite  their  general 
validity,  the  AI's  are  clearly  not  representative  in  the  sense 
that  they  are  based  on  instantaneous  measurements.  SCIM  employs 
circuits  which  integrate  the  system  characteristics  over  an 
extended  period  of  time  before  an  SCI  is  computed.  If  SCIM  were 
used  to  process  the  Program  Tapes  prepared  in  this  study,  and  if 
the  integration  time  and  sampling  rate  of  SCIM  were  properly 
adjusted,  then  the  resulting  SCI's  should  accurately  predict  the 
obtained  intelligibility  scores.  It  is  concluded,  therefore, 
that  SCIM  is  potentially  suitable  for  the  automatic  evaluation 
of  time-varying  communication  systems. 

Although  it  is  difficult  to  make  specific  recommendations 
for  the  possible  modification  of  SCIM,  the  results  of  this  study 
suggest  that  the  time  intervals  over  which  SCIM  presently 
integrates  the  received  "speech"  and  noise  signals  are  too  long. 
The  "speech"  is  integrated  over  1  second  and  the  noise  over  2 
seconds.  For  the  evaluation  of  time-varying  systems,  an  inte¬ 
gration  time  of  approximately  500  milliseconds  for  both  "speech" 
and  noise  would  appear  to  be  more  appropriate.  This  interval 
corresponds  roughly  to  the  duration  of  a  syllable. 
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Since  individual  SCI's  are  difficult  to  interpret,  it  may 
be  desirable  to  have  SCIM  compute  a  large  number  of  SCI's  before 
an  average  score  is  displayed.  The  average  score  could  then  be 
used  to  estimate  the  probability  that  a  syllable  will  be  correct¬ 
ly  identified  over  the  system,  regardless  of  when  the  syllable 
is  transmitted.  It  is  clear  that  such  scores  must  be  eventually 
related  to  how  well  longer  message  units  (e.g.  phrases  and 
sentences)  can  be  understood.  The  ultimate  measure  of  system 
performance  is  not  the  intelligibility  of  a  series  of  independent 
test  words,  but  the  ease  and  reliability  with  which  system  users 
can  transmit  and  receive  operational  messages. 
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2.  EVALUATION  OF  COMMUNICATION  SYSTEMS  EMPLOYING  PEAK  CLIPPING 


2 . 1  Introduction  and  Approach 

In  many  radio-communication  systems,  the  speech  signal  is 

subjected  to  peak  clipping  at  the  transmitter.  Prior  to  the 

modulation  stage,  the  extreme  peaks  of  the  speech  signal  are 

intentionally  removed  so  that  a  given  level  of  transmitter 

power  can  be  more  efficiently  utilized.  Since  peak  clipping 

and  subsequent  restoration  of  the  maximum  amplitude  increases 

20 

the  long-term  RMS  power  of  a  speech  signal,  systems  employing 
even  moderate  levels  of  peak  clipping  tend  to  transmit  more 
intelligible  speech  than  systems  that  do  not  employ  peak  clipping. 
In  the  evaluation  of  such  systems  with  a  device  like  SCIM,  this 
effect  must  be  taken  into  account.  The  computed  SCI's  should 
accurately  predict  the  intelligibility  scores  that  might  be 
obtained  with  the  aid  of  speakers  and  listeners. 

Q 

The  results  of  an  earlier  study  indicated  that  SCIM  was 
unable  to  predict  the  intelligibility  of  peak-clipped  speech. 

In  the  present  study,  this  problem  was  investigated  in  some 
detail  to  determine  whether  SCIM  could  be  modified  to  compute 
more  reliable  SCI's.  Since  peak  clipping  changes  the  amplitude 
distribution  of  a  speech  signal,  it  was  possible  that  the 
difficulty  encountered  with  SCIM  was  inherent  in  the  concept  of 
the  AI,  on  which  the  design  of  SCIM  is  based.  Certain  assump¬ 
tions  made  in  the  calculation  of  an  AI  are  violated  in  the  case 
of  peak-clipped  speech.  If  the  problem  could  be  explained  in 
terms  of  this  argument,  then  a  simple  equipment  modification 
would  not  be  a  solution.  On  the  other  hand,  considering  that 
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SCIM  really  measures  RMS  rather  than  peak  S/N  ratios,  and  that 
only  moderate  levels  of  peak  clipping  are  involved,  a  relatively 
minor  change  in  SCIM  could  lead  to  the  desired  performance. 

Under  certain  operating  conditions  of  a  communication 
system,  the  speech  signal  may  be  subjected  to  unintentional  peak 
clipping.  This  may  happen  because  of  a  faulty  component,  an 
equipment  mismatch,  or  other  factors  which  are  difficult  to 
control  and  correct  in  a  field  situation.  The  clipping  may  not 
be  symmetrical  for  positive  and  negative  amplitudes,  and  the 
level  of  clipping  may  change  as  a  function  of  time.  If  SCIM 
were  to  be  used  to  evaluate  such  a  system,  it  would  be  necessary 
to  provide  a  device  which  could  detect  the  type  and  level  of 
clipping  that  might  be  encountered.  In  view  of  the  difficulties 
associated  with  the  design  of  such  a  device,  it  was  decided  that 
the  present  study  would  not  be  concerned  with  unintentional  peak 
clipping. 

A  survey  of  the  literature  was  made  to  determine  the  intell¬ 
igibility  of  speech  for  different  amounts  of  peak  clipping  and 
for  various  S/N  ratios.  SCIM  measurements  were  then  obtained 
for  these  same  conditions.  Using  a  compilation  of  the  most 
applicable  and  reliable  data  found  in  the  literature  as  a  model, 
the  performance  of  SCIM  was  critically  evaluated.  After  the 
primary  cause  of  the  incorrect  SCI's  was  identified  and  eliminated, 
the  performance  of  SCIM  was  found  to  be  adequate.  A  conversion 
chart  was  prepared  for  relating  SCI's  to  intelligibility  scores 
when  the  level  of  peak  clipping  is  known. 
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2.2  Effect  of  Peak  Clipping  on  Speech  Intelligibility 


A  block  diagram  of  a  typical  radio-communication  system 
employing  peak  clipping  is  shown  in  Fig.  2-l(a).  After  the 
speech  signal  is  subjected  to  peak  clipping,  the  maximum 
amplitude  of  the  signal  is  restored  to  its  original  value.  At 
the  receiving  antenna,  the  transmitted  radio  signal  is  mixed 
with  radio-frequency  noise.  Such  a  system  may  be  simulated  in 
the  laboratory  by  means  of  the  instrumentation  indicated  in  Fig. 
2-l(b).  All  of  the  system  components  shown  in  Fig.  2-l(a)  have 
their  counterparts  in  Fig.  2-l(b),  except  that  the  noise  source 
is  more  clearly  defined  in  the  laboratory  simulation.  Instru¬ 
mentation  such  as  is  shown  in  Fig.  2-1 (b)  has  been  used  exten¬ 
sively  for  the  purpose  of  measuring  the  effects  of  peak  clipping 
on  speech  intelligibility. 

Because  different  studies  have  used  different  speech 
materials,  different  kinds  of  noise,  different  methods  of 
measuring  S/N  ratio,  and  even  different  definitions  of  clipping 
level,  it  was  difficult  to  integrate  the  results  of  the  studies 
reported  in  the  literature.  Consider,  for  example,  the  pre¬ 
sumedly  simple  relation  between  S/N  ratio  and  Harvard  PB-word 
intelligibility'''  for  white  masking  noise.  Here  the  speech 
material  and  the  kind  of  noise  are  clearly  defined,  and  no 
clipping  is  involved.  However,  as  is  evident  from  Fig.  2-2, 
the  results  obtained  in  different  studies  vary  considerably. 

-j  O 

In  the  study  reported  by  Stuckey,  the  listeners  may  have  been 
over-exposed  to  particular  randomizations  of  the  word  lists 
used.  They  were  also  permitted  to  adjust  their  individual 
listening  levels.  Whether  either  of  these  factors  was  respon¬ 
sible  for  the  unusually  high  test  scores  is  not  known. 
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I  SOURCE  OF 
R.  F.  NOISE 


FIG. 2-1  BLOCK  DIAGRAMS  OF  (a)  TYPICAL  RADIO 
COMMUNICATION  SYSTEM  EM PLO Y I  NG  P EA K 
CLI  PPING,  AND  (b)  INSTRUMENTATION  USED 
TO  SIMULATE  AND  EVALUATE  SUCH  A  SYSTEM. 
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RELATION  BETWEEN  S PEEC H-TO- NO  I S E  RATIO  AND 
PB-WORD  INTELLIGIBILITY  FOR  WHITE  MASKING  NOISE. 


5 

The  vocabulary  in  the  study  published  by  Hirsh  was  limited 

to  only  200  words.  Since  the  full  vocabulary  of  the  Harvard 

PB-Word  Test  consists  of  1000  words,  this  might  explain  why 

the  test  scores  were  somewhat  higher  than  those  reported  in 

other  studies.  The  oldest  study  considered,  published  in  a 

classic  paper  by  Miller,'*''*'  produced  scores  that  are  10  percentage 

points  lower  at  some  S/N  ratios  than  the  scores  obtained  by 

14  15  16 

Pollack  and  Pickett  in  three  separate  studies.  ’  ’  The  curve 

drawn  through  the  data  points  in  Pig.  2-2  therefore  only  approx¬ 
imates  the  relation  between  S/N  ratio  and  speech  intelligibility. 

When  peak  clipping  is  introduced  in  a  given  experiment,  the 
prevailing  S/N  ratio  is  most  meaningfully  specified  in  terms  of 
measurements  made  when  the  clipping  level  is  zero.  This  conven¬ 
tion  preserves  the  usual  definition  of  S/N  ratio  as  the  amplitude 
relation  between  the  RMS  speech  level  and  the  RMS  noise  level, 
expressed  in  dB.  The  clipping  level  is  usually  defined  in  terms 
of  the  level  that  is  exceeded  1  percent  of  the  time  in  an  un¬ 
processed  speech  signal  sampled  8  times  per  second.  When  only 
those  few  speech  peaks  with  amplitudes  larger  than  this  level 
are  clipped,  the  clipping  level  is  referred  to  as  0  dB.  Clipping 
levels  greater  than  about  40  dB  reduce  the  speech  signal  to  a 
sequence  of  square  pulses  (infinitely  clipped  speech). 

After  peak  clipping,  the  loss  in  amplitude  of  the  speech 
signal  is  usually  compensated  by  amplification  before  the 
signal  is  mixed  with  noise.  Two  types  of  compensation  are 
commonly  employed;  level  compensation  and  power  compensation. 

In  level  compensation,  the  original  peak  level  of  the  speech 
signal  is  restored.  Hence  the  gain  of  the  compensating  ampli¬ 
fier,  expressed  in  dB,  is  exactly  equal  to  the  clipping  level. 
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In  power  compensation,  tne  original  RMS  power  of  the  speech 

signal  is  restored.  For  power  compensation  the  gain  of  the 

compensating  amplifier  is  always  less  than  the  clipping  level. 

These  two  forms  of  compensation  are,  of  course,  related.  With 

20 

the  aid  of  the  relation  derived  by  Wathen-Dunn  and  Lipke,  it 
is  possible  to  convert  data  obtained  with  one  form  of  compensa¬ 
tion  to  data  that  might  have  been  obtained  with  the  other  form 
of  compensation. 

The  results  of  three  studies  reported  in  the  literature^ ’ ^ 
are  shown  in  Figs.  2-3  and  2-4.  For  a  given  S/N  ratio,  peak 
clipping  and  subsequent  level  compensation  generally  increase 
intelligibility  (Fig.  2-3),  whereas  peak  clipping  and  subsequent 
power  compensation  have  no  appreciable  effect  on  intelligibility 
(Fig.  2-4).  Because  level  compensation  is  typically  employed  in 
radio-communication  systems,  the  data  given  in  Figs.  2-2  and  2-3 
were  combined  and  averaged  to  obtain  a  "psychoacoustic  model'' 
against  which  the  performance  of  SCIM  could  be  compared.  The  data 
constituting  this  model  are  given  in  Fig.  2-5- 


2.3  Performance  of  SCIM 


The  results  obtained  with  SCIM  are  shown  in  Fig.  2-6.  Each 
point  in  this  figure  represents  the  mean  of  3  display  readings. 

In  comparing  these  results  with  the  corresponding  data  from  the 
literature  (Figs.  2-3  and  2-4),  it  is  obvious  that  the  performance 
of  SCIM  was  grossly  inadequate.  This  finding  confirmed  the 
earlier  reports  that  SCIM  appeared  to  be  unable  to  predict  the 
intelligibility  of  peak-clipped  speech.  In  obtaining  the  results 
shown  in  Fig.  2-6,  it  was  noted  that  the  higher-frequency  filters 
in  the  SCIM  analyzer  barely  contributed  to  the  total  SCI  when 
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CLIPPING  LEVEL  IN  dB 


FIG.  2-3  PB-WORD  INTELLIGIBILITY  AS  A  FUNCTION  OF 
CLIPPING  LEVEL  FOR  PEAK  CLIPPING  WITH 
LEVEL  COMPENSATION.  THE  PARAMETER  IS 
SPEECH-TO-NOI  SE  RATIO.  DATA  FROM  POLLACK 
AND  PICKETT  1959  (•—•),  KRYTER  ET  AL.  1947 
(o— o),  AND  EWING  AND  HUDDY  1966  (a — a). 


FIG. 2-4  PB-WORD  INTELLIGIBILITY  AS  A  FUNCTION  OF 
CLI PPI NG  LEVEL  FOR  PEAK  CLIPPING  WITH 
POWER  COMPENSATION.  THE  PARAMETER  IS 
SPEECH -TO- NOISE  RATIO.  DATA  FROM  POLLACK 
AND  PICKETT  1959  (• — •),  KRYTER  ET  AL.  1947 
(o - o)  ,  AND  EWING  AND  HUDDY  1966(a— ■*). 


CLIPPING  LEVEL  IN  dB 


FIG. 2-5  PB-WORD  I  NTELL I  G  I  B  I  L I T  Y  A  S  A  F  U  N  CTION  OF 
CLIPPING  LEVEL  FOR  PEAK  CLIPPING  WITH 
LEVEL  COMPENSATION.  THE  PARAMETER  IS 
SPEECH-TO-NOI SE RATIO.  DATA  ESTIMATED 
AND  AVERAGED  FROM  FIGS.  2-2  AND  2-3. 
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high  clipping  levels  were  employed.  Despite  a  very  favorable 
overall  S/N  ratio,  the  S/N  ratios  measured  by  SCIM  in  some  of 
the  high-frequency  bands  were  consistently  low. 

The  source  of  this  problem  was  traced  to  the  tones  which 
are  normally  transmitted  by  the  SCIM  signal  generator  to  uisable 
Automatic  Gain  Control  (AGC)  circuits  in  the  system  under 
evaluation.  During  the  interval  in  which  the  SCIM  analyzer 
measures  noise  in  the  upper  portion  of  the  spectrum,  a  *100  Hz 
tone  is  received  in  the  lower  portion  of  the  spectrum.  V.'hile 
this  tone  does  not  normally  interfere  with  the  measurements, 
amplitude  clipping  will  introduce  strong  harmonics  of  the  tone 
into  the  upper  portion  of  the  spectrum.  These  harmonics  serve 
to  reduce  the  S/N  ratios  measured  in  the  high-frequency  bands. 


With  the  AGC-tone  oscillators  removed  from  the  SCIM  signal 
generator,  the  performance  of  SCIM  was  measured  again.  The  new 
results  obtained  for  peak  clipping  with  level  compensation  and 
peak  clipping  with  power  compensation  are  shown  in  Figs.  2-7 
and  2-8,  respectively.  Since  these  results  compared  fairly  well 
with  the  published  intelligibility  data  (Figs.  2-3  and  2— *4 ) ,  the 
performance  of  SCIM  was  considered  adequate.  Further  work  con¬ 
sisted  of  relating  a  given  SCI  and  a  given  clipping-  level  to  a 
unique  intelligibility  score. 


2 .  *4  Preparation  of  Conversion  Chart 


When  SCIM  is  used  to  evaluate  a  typical  communication  system 
employing  peak  clipping  and  level  compensation,  the  only  avail¬ 
able  information  consists  of  the  SCI  and  the  clipping  level.  In 
most  situations,  the  overall  S/N  ratio  (defined  for  a  clipping 
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SPEECH  COMMUNICATION  INDEX  (SCI) 


FIG. 2-7  SPEECH  COMMUNICATION  INDEX  AS  A 
FUNCTION  OF  CLIPPING  LEVEL  FOR  PEAK 
CLIPPING  WITH  LEVEL  COM  PEN  S  ATION.  T  HE 
PARAMETER  IS  SIGN AL-TO- NOI  SE  RATIO. 
DATA  OBTAINED  WITHOUT  AGC  TONES. 


SPEECH  COMMUNICATION  INDEX  (SCI) 


FIG. 2-8  SPEECH  COMMUNICATION  INDEX  AS  A 
FUNCTION  OF  CLIPPING  LEVEL  FOR  PEAK 
CLIPPING  WITH  POWER  COM  PEN  SATION.  T  HE 
PARAMETER  IS  S  I  GNAL-TO-NOI  SE  RATIO. 
DATA  OBTAINED  WITHOUT  AGC  TONES. 


level  of  0  dB)  is  not  known.  To  be  able  to  convert  the  available 
information  into  an  intelligibility  score  that  might  be  obtained 
in  a  formal  listening  test,  it  was  necessary  to  derive  the 
relation  between  SCI's  and  Harvard  PB-word  intelligibility  for 
various  clipping  levels. 

Using  the  data  shown  in  Fig.  2-7,  S/N  ratios  were  estimated 
for  21  SCI's  (covering  the  range  from  0.00  to  1.00  in  steps  of 
0.05)  at  each  clipping  level.  These  S/N  ratios  were  then  trans¬ 
lated  into  intelligibility  scores,  using  the  data  shown  in  Fig.  2-5. 
For  example,  the  S/N  ratio  corresponding  to  an  SCI  of  0.40  and  a 
clipping  level  of  12  dB  is  approximately  2.5  dB.  (See  Fig.  2-7.) 

This  S/N  ratio,  at  the  same  clipping  level,  corresponds  to  an 
intelligibility  score  of  about  90  percent.  (See  Fig.  2-5.)  The 
relation  between  SCI's  and  intelligibility  scores  thus  derived 
is  shown  in  Fig.  2-9-  The  data  from  which  this  figure  was  pre¬ 
pared  are  given  in  Appendix  9* 


2 . 5  Conclusions 


The  principal  conclusion  drawn  from  the  results  of  this  study 
was  that  SCIM  is  potentially  capable  of  predicting  the  intelligi¬ 
bility  of  peak-clipped  speech.  The  problem  encountered  earlier 
was  not  due  to  a  fundamental  shortcoming  of  the  AI  concept,  but 
to  a  minor  defect  in  the  design  of  SCIM  which  can  be  easily 
remedied.  In  order  to  retain  the  advantages  of  employing  AGC 
tones  during  the  assessment  of  certain  types  of  communication 
systems,  it  was  necessary  to  modify  both  the  SCIM  signal  gener¬ 
ator  and  the  SCIM  analyzer.  These  modifications  are  described 
in  the  following  section. 
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FIG. 2-9  RELATION  BETWEEN  SPEECH  COMMUNICATION 
INDEX  AND  PB-WORD  I  NTELL I  G I  B  I  L I T Y  FOR 
PEAK  CLIP  PING  WITH  LEVEL  COMPENSATION. 
THE  PARAMETER  IS  CLIPPING  LEVEL. 


3.  EQUIPMENT  MODIFICATIONS 


3 . 1  Introduction 

During  the  course  of  this  study,  SCIM  was  modified  and 
improved  in  several  respects.  Some  of  these  modifications  were 
made  in  direct  compliance  with  the  requirements  stated  in  the 
contract.  These  included  the  changes  necessary  for  the  evalua¬ 
tion  of  systems  employing  peak  clipping,  and  the  reduction  in 
computation  time.  Other  modifications  and  improvements  were 
implemented  because  they  appeared  generally  desirable  and  could 
be  easily  accomplished.  These  included  replacing  some  compon¬ 
ents  by  improved  versions,  and  minor  changes  that  would  facili¬ 
tate  equipment  calibration. 

A  second  addendum  to  the  Operation/Maintenance  Manual 

19 

for  SCIM  was  prepared.  This  addendum  contains  technical  details 
and  schematic  diagrams  for  all  equipment  modifications  that  were 
made.  General  descriptions  of  some  of  the  more  important  modifi¬ 
cations  are  given  below. 


3 . 2  Modifications  for  the  Evaluation  of  Systems  Employing 

Peak  Clipping 

Examination  of  the  measuring  procedure  inherent  to  SCIM 
revealed  that  the  clipped  400  Hz  AGC  tone,  which  is  present  when 
noise  measurements  are  being  made  in  the  high-frequency  bands, 
has  harmonics  of  considerable  amplitude  in  the  frequency  region 
of  these  bands.  The  presence  of  this  tone,  therefore,  seriously 
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interfered  with  the  measurements  made  by  SCIM.  In  view  of  this 
problem,  it  was  decided  to  modify  SCIM  as  follows:  (1)  A  3-KHz 
AGC  tone  would  be  substituted  for  the  400-Hz  AGC  tone  so  that 
the  harmonics  of  the  AGC  tone  would  fall  outside  of  the  last 
band  employed  by  SCIM.  (2)  A  sharp-skirted  3-KHz  notch  filter 
would  be  designed  to  provide  sufficient  attenuation  of  the  AGC 
tone  so  that  the  presence  of  the  tone  does  not  appreciably  affect 
the  performance  of  SCIM.  The  center  frequency  of  the  notch 
filter  would  be  manually  variable  over  a  limited  range  to  accommo¬ 
date  possible  frequency  translation,  as  might  be  encountered 
with  single-sideband  radio  systems. 

The  SCIM  signal  generator  was  modified  to  provide  a  3-KHz 
AGC  tone  during  both  time  intervals  in  which  the  analyzer  per¬ 
forms  noise  measurements.  For  calibration  purposes,  the  3-KHz 
tone  can  also  be  transmitted  independent  of  a  test-signal 
sequence.*  A  3-KHz  notch  filter  was  constructed  and  installed 
into  the  SCIM  analyzer.  During  calibration,  the  transmitted 
3-KHz  tone  is  "tuned  out"  at  the  notch  filter.  A  special  circuit 
and  associated  panel  meter  have  been  installed  in  the  SCIM 
analyzer  to  facilitate  this  procedure. 


3 . 3  Reduction  of  Computation  Time  and  Improved  Synchronization 


A  timing  diagram  of  the  present  test-signal  sequence  and 
analyzer  timing  sequence  is  shown  in  Fig.  3-1*  In  the  previous 


*  By  means  of  two  switches,  the  former  arrangement  of  AGC  tones 
is  still  available. 
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FIG. 3-1  TIMING  SEQUENCE  OF  SIGNAL  GENERATOR  AND  ANALYZER. 


analyzer  timing  sequence,  each  change  in  mode  was  mediated  by 
a  new  tone-pair  synchronization  signal  from  the  generator.  In 
the  present  scheme,  only  one  tone  pair  is  transmitted,  which 
triggers  an  analyzer  timing  sequence  that  is  independent  of  the 
test-signal  sequence.  To  allow  for  slight  inaccuracies  in  the 
synchronization  between  the  test-signal  sequence  and  the  anal¬ 
yzer  timing  sequence,  the  transmission  times  of  the  "speech" 
signal  and  the  AGC  tones  were  increased  from  1.0  to  1.2  seconds, 
while  the  integration  times  remained  fixed  at  1  second.  Thus, 
approximately  200  milliseconds  were  provided  between  integration 
intervals  to  insure  that  the  correct  signals  are  integrated, 
even  if  the  analyzer  timing  sequence  should  be  slightly  shifted 
in  time  relative  to  the  test-signal  sequence. 

The  time  intervals  necessary  for  sampling  the  integrator 
outputs  were  reduced  from  1.0  to  0.2  seconds.  The  measurement 
accuracy  was  found  to  be  unaffected  by  this  change.  This  modi¬ 
fication  reduced  the  computation  time  from  10  to  2  seconds,  and 
the  total  run  time  from  12  to  4.4  seconds. 

A  disadvantage  of  the  old  timing  scheme  was  that  if  any  one 
of  the  tone  pairs  was  missed,  or  if  a  spurious  tone  pair  was 
received,  the  analysis  was  negated  and  the  analyzer  had  to  be 
manually  reset.  This  occurred  quite  often  in  the  evaluation  of 
time-varying  communication  systems.  When  SCIM  was  equipped  with 
a  data  logger  (printer  and  clock),  the  printed  data  were  often 
unreliable.  Another  disadvantage  was  that  the  tone  pairs  were 
integrated  along  with  the  "speech"  and  noise  signals.  Although 
this  error  was  minor,  it  could  not  be  completely  eliminated. 

Since  the  present  synchronization  scheme  requires  only  one 
initial  tone  pair,  the  probability  that  a  successfully  started 
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analysis  run  will  be  disturbed  has  been  reduced  considerably. 
The  amplitude  of  the  tone  pair  was  increased  above  its  former 
value  (this  was  permissable  because  it  is  no  longer  integrated) 
to  reduce  the  chances  of  a  missed  start  command.  In  the  new 
scheme,  the  synchronization  detection  system  is  disabled  until 
an  analysis  run  is  complete,  so  that  a  spurious  tone  pair 
occurring  after  the  reception  of  the  initial  tone  pair  will  not 
be  interpreted  as  another  start  command. 

This  new  scheme  allows  for  the  complete  automation  of 
measurements  if  SCIM  is  used  with  a  data  logger .  If  a  start 
command  is  missed,  the  complete  run  would  be  missed.  If  a 
spurious  tone  pair  occurs,  which  can  only  be  detected  during  a 
missed  run,  the  analyzer  would  reset  itself  and  be  ready  for 
the  next  valid  tone  pair.  It  is  doubtful  that  the  latter  con¬ 
dition  would  occur,  because  if  the  start  command  were  too  weak 
to  be  detected,  a  spurious  tone  pair  within  the  next  four 
seconds  would  probably  also  be  too  weak  to  be  detected. 


3  .  Other  Modifications 

Although  the  dynamic  range  of  the  SCIM  analyzer  was 
adequate  for  most  noise  spectra  that  are  likely  to  be  en¬ 
countered,  a  signal  with  appreciable  high-frequency  energy 
could  overload  the  detectors.  This  potential  problem  was 
eliminated  by  increasing  the  dynamic  range  of  the  detectors, 
and  by  resetting  the  VU  meter  to  allow  better  utilization  of 
the  increased  range.  This  modification  is  illustrated  in 
Pig.  3-2.  The  shaded  areas  in  this  figure  represent  the  filter 
input  ranges  for  linear  detector  output,  before  the  modification. 
The  total  column  lengths  represent  the  input  ranges  of  the 
present  system.  The  improvement  in  linearity  is  shown  for  Filter 
No.  4. 
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FIG. 3-2  ILLUSTRATION  OF  IMPROVEMENT  IN  DYNAMIC  RANGE  OF  FILTERS 
IN  ANALYZER.  DYNAMIC  RANGE  IS  REGION  OVER  WHICH  FILTER 
INPUT  AND  DETECTOR  OUTPUT  ARE  RELATED  LINEARLY. 


Previously,  the  back-to-back  calibration  was  accomplished 
with  the  aid  of  a  random-noise  source.  During  the  calibration 
procedure,  as  much  as  +  0.10  SCI  of  scatter  has  been  encount¬ 
ered  at  low  SCI  settings.  A  new  calibration  module,  consisting 
of  9  oscillators  whose  outputs  are  summed,  has  been  designed 
and  fabricated.  Using  this  module,  the  maximum  scatter  was 
found  to  be  +  0.01  SCI  at  all  SCI  settings.  The  new  module 
thus  permits  a  faster  and  more  accurate  calibration  of  SCIM. 
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Appendix  1 


Word  contents  of  twelve  25-item  half 
lists  of  the  Modified  Rhyme  Test. 


Initial  Consonant  Variable 


Half  List 

No. 

A 

B 

C 

D 

E 

F 

1 

sold 

fold 

gold 

told 

hold 

cold 

2 

dip; 

wig 

pig 

rig 

big 

fig 

3 

book 

took 

look 

cook 

hook 

shook 

14 

gale 

male 

tale 

bale 

sale 

pale 

5 

will 

hill 

kill 

till 

fill 

bill 

6 

fame 

same 

came 

name 

game 

tame 

7 

rang 

fang 

hang 

bang 

sang 

gang 

8 

sip 

rip 

tip 

hip 

dip 

lip 

9 

feat 

meat 

heat 

seat 

beat 

neat 

10 

hot 

got 

not 

pot 

lot 

tot 

11 

dust 

just 

rust 

must 

gust 

bust 

12 

top 

cop 

pop 

hop 

shop 

mop 

Final  Consonant  Variable 

13 

dig 

dip 

dim 

did 

dill 

din 

in 

duck 

dun 

dung 

dub 

dug 

dud 

15 

fizz 

fig 

fin 

fill 

fib 

fit 

16 

came 

cape 

cane 

case 

cave 

cake 

17 

kick 

king 

kid 

kit 

kin 

kill 

18 

late 

lay 

lake 

lace 

lane 

lame 

19 

pale 

pane 

pace 

pay 

page 

pave 

20 

Pig 

pick 

Pip 

pin 

pill 

pit 

21 

pun 

puff 

pup 

puck 

pus 

pub 

22 

rave 

rake 

race 

rate 

raze 

ray 

23 

seep 

seen 

seethe 

seem 

seed 

seek 

24 

sub 

sum 

sud 

sun 

sup 

sung 

25 

teach 

tear 

tease 

teal 

team 

teak 
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Appendix  lt  cont. 


Initial 

Consonant 

Variable 

No. 

Half  List 

G 

H 

I 

J 

K 

L 

1 

bed 

shed 

red 

led 

fed 

wed 

2 

kick 

lick 

sick 

tick 

wick 

pick 

3 

hark 

dark 

mark 

lark 

park 

bark 

4 

peel 

reel 

feel 

heel 

keel 

eel 

5 

foil 

oil 

coil 

boil 

soi  1 

toil 

6 

ten 

pen 

men 

hen 

then 

den 

7 

pin 

sin 

tin 

win 

din 

fin 

8 

sun 

nun 

gun 

fun 

run 

bun 

9 

tent 

bent 

dent 

went 

rent 

sent 

10 

raw 

paw 

law 

jaw 

thaw 

saw 

11 

kit 

bit 

fit 

sit 

wit 

hit 

12 

nest 

vest 

west 

test 

best 

rest 

13 

may 

way 

say 

gay 

day 

pay 

Final 

Consonant 

Variable 

14 

ban 

bath 

back 

bass 

bat 

bad 

15 

bean 

beach 

beat 

beam 

bead 

beak 

16 

bun 

bus 

but 

buff 

buck 

bug 

17 

heath 

heap 

heal 

hear 

heat 

heave 

18 

cut 

cub 

cuff 

cup 

cud 

cuss 

19 

map 

mat 

math 

man 

mass 

mad 

20 

pass 

pack 

pat 

pad 

path 

pan 

21 

peace 

peas 

peak 

peal 

peat 

peach 

22 

sake 

sale 

save 

sane 

safe 

same 

23 

sad 

sap 

sass 

sat 

sap 

sack 

24 

sit 

sing 

sin 

sick 

sip 

sill 

25 

tab 

tan 

tarn 

tang 

tack 

tap 
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Appendix  2 


Half-list  MRT  scores  (corrected  for  chance) 
obtained  from  8  listeners  in  an  experiment 
employing  steady-state  simulated  aircraft 
noise.  Data  from  Williams  et  al.  1967. 


S/N  Ratio 


Half 

List 

+  6dB 

OdB 

-2dB 

-4dB 

-6dB 

Mean 

A 

8^.-4% 

57.4% 

6l.6% 

50.8% 

34.6% 

64.8% 

D 

'  88.6 

55.0 

50.8 

40.6 

37.0 

62.0 

C 

36.2 

70.6 

58.6 

34.6 

26.8 

62.8 

D 

90.4 

71.8 

60.4 

56.2 

43.0 

70.3 

E 

86.2 

73.6 

65.2 

49.6 

41.2 

69.3 

F 

84. 4 

62.8 

53.8 

44.2 

29.8 

62.5 

0 

85.0 

68.8 

54.4 

44.2 

33.4 

64.3 

H 

93.4 

75.4 

64.6 

50.2 

25.0 

68.1 

I 

81.4 

62.2 

67.0 

49.0 

28.6 

64.7 

J 

89.2 

73.6 

53.2 

46.0 

35.2 

66.2 

K 

86.2 

65.8 

56.2 

43.6 

28.0 

63.3 

L 

88.0 

67 . 6 

49.6 

51.4 

33.4 

65.0 

63 


Appendix  3 


Order  in  which  75  Combination  Tapes  were 
administered  to  listeners  during  five  test 
sessions.  Each  Combination  Tape  is  identi¬ 
fied  according  to  the  Speech  Tape  (1-25)  and 
Program  Tape  (I-III)  from  which  it  was  prepared. 


Test  Session 


1 

2 

3 

4 

5 

1-1 

16-III 

2-1 

15-IH 

6-II 

2-II 

17-H 

13-H 

4-1 

25-HI 

3-III 

18-1 

l-III 

24-H 

15-1 

4-II 

20-11 

11-11 

12-1 

9-III 

5-III 

19-1 

18-III 

21-11 

17-1 

6-1 

21-III 

22-1 

ll-III 

12-11 

7-III 

22-11 

7-II 

16-11 

5-1 

9-H 

23-IH 

25-1 

3-1 

14-11 

8-1 

24-1 

10-III 

20-III 

4  - 1 1 1 

10-11 

25-H 

23-H 

l-II 

21-1 

11-1 

8-III 

19-HI 

13-IH 

13-11 

12-III 

14-1 

20-1 

7-1 

22-III 

13-1 

5-II 

3-H 

2-III 

19-11 

14-III 

9-1 

17-IH 

23-1 

10-1 

15-H 

6-III 

16-1 

3-II 

2  4  - 1 1 1 
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NAME 


Appendix  4  Typical  response  form. 

_ DATE _ TEST  NO. 


bust 

just 

dust 

16 

game 

fame 

tame 

rust 

gust 

must 

same 

name 

came 

31 


keel 

reel 

peel 

feel 

eel 

heel 

cake 

cape 

cave 

17 

puck 

pus 

pup 

32 

oil 

toil 

soil 

came 

case 

cane 

pun 

pub 

puff 

coil 

foil 

boil 

fill 

fizz 

fib 

fig 

fit 

fin 

18 


pin 

Pig 

pill 

pit 

pick 

Pip 

4 

dug 

duck 

dub 

19 

pin 

sin 

din 

dung 

dud 

dun 

tin 

fin 

win 

pop 

top 

hop 

20 

man 

map 

mass 

shop 

cop 

mop 

math 

mad 

mat 

dim 

dill 

did 

21 

run 

sun 

bun 

dip 

dig 

din 

nun 

gun 

fun 

fed 

shed 

wed 

22 

rent 

tent 

sent 

red 

bed 

led 

bent 

went 

dent 

tack 

tab 

tan 

tarn 

tap 

tang 

23 


jaw 

thaw 

raw 

paw 

saw 

law 

sick 

sit 

sip 

24 

cuff 

cut 

cup 

sing 

sill 

sin 

cuss 

cud 

cub 

10 


1 1 


12 


13 


14 


15 


sick 

tick 

wick 

25 

kid 

kin 

kit 

kick 

pick 

lick 

king 

kill 

kick 

sap 

sad 

sass 

26 

teal 

teach 

team 

sag 

sack 

sat 

tease 

teak 

tear 

park 

hark 

lark 

■  dark 

bark 

mark 

bale 

gale 

sale 

male 

pale 

tale 

27 


28 


sud 

sung 

sup 

sum 

sub 

sun 

33 


34 


35 


36 


37 


38 


39 


40 


41 


42 


save 

sake 

safe 

sale 

same 

sane 

46 


47 


48 


beat 

beam 

bean 

beak 

bead 

beach 

back 

bath 

ban 

bad 

bat 

bass 

west 

nest 

rest 

best 

test 

vest 

men 

then 

ten 

49 

dig 

did 

hen 

den 

pen 

din 

dim 

peak  peat  peal 
peach  peace  peas 


50 


hold 

sold 

fold 

gold 

cold 

told 

path 

pass 

pad 

pack 

pat 

pan 

teak 

teal 

tear 

team 

teach 

tease 

hang 

sang 

rang 

bang 

gang 

fang 

pane 

pace 

page 

pale 

pave 

Pay 

dip 

sip 

lip 

rip 

tip 

hip 

52 


53 


54 


sum 

sup 

sud 

sub 

sun 

sung 

rig 

dig 

big 

wig 

fig 

Pig 

seep  seed  seen 
seek  seethe  seem 


heat 

feat 

beat 

55 

dark 

bark 

mark 

meat 

neat 

seat 

park 

hark 

lark 

not 

lot 

pot 

56 

peel 

heel 

feel 

hot 

tot 

got 

keel 

reel 

eel 

lace 

late 

lane 

57 

coil 

boil 

toil 

lame 

lay 

lake 

soil 

oil 

foil 

dig 

Pig 

big 

43 

hear 

heap 

heat 

58 

sake 

sane  sale 

rig 

wig 

fig 

heath 

heave 

heal 

safe 

save  same 

fill 

will 

hill 

29 

seed 

seep 

seem 

kill 

bill 

till 

seen 

seek 

seethe 

rate 

rave 

raze 

rake 

ray 

race 

30 


hook 

shook 

cook 

look 

book 

took 

44 


45 


wit 

kit 

bit 

59 

den 

hen 

pen 

fit 

hit 

sit 

then 

men 

ten 

but 

bun 

buff 

60 

peas 

peal 

peak 

bus 

bug 

buck 

peat 

peach 

peace 
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Errors  made  by  10  listeners  at  each  word  location  of  Program  Tape 
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CN 
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CM 
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96.0 

OO 

C\J 
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O 
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VO 

CM 
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CO 
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CM 
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Errors  made  by  10  listeners  at  each  word  location  of  Program  Tape 


WORD  LOCATION 

c 

-=T 

E-' 

CO 

cp 

vo 

-^r 

-=T 

10 

CO 

vo 

-=T 

CM 

i — 1 

rH 

-=r 

on 

vo 

rH 

CM 

CM 

LP 

Cr 

vo 

o 

on 

E — 

t — 
LP 

CP 

on 

-=r 

i — 1 

O 

CM 
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CM 

oo 

o 

LP 
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CP 

CO 

on 

CO 
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CP 

LP 

CP 
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o 
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vo 

CO 

cr 

LP 
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E — 

CP 

vo 
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rH 
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vo 

on 
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VO 

CP 

LP 
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CP 
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on 
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rH 
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LP 
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Errors  made  by  10  listeners  at  each  word  location  of  Program  Tape 


WORD  LOCATION 

60 

on 

in 

10 

in 

0- 

CM 
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i — l 

cd 

CM 

in 
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vo 

CM 

CO 

1 — 1 

rH 

CO 

CO 
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Errors  made  by  10  listeners  at  each  word  location  of  Program  Tape  II . 


WORD  LOCATION 

20 

VO 

on 

in 

on 

rH 

-=r 

CM 

on 

on 

rH 

on 

in 

vo 

o 

CO 

rH 

2.12 

in 

CO 

t-- 

cn 

< — 1 

rH 

on 

i — 1 

on 

- =T 

VO 

CD 

rH 

rH 

rH 

vo 

-3- 

1 — 1 

CM 

-=T 

CM 

oo 

CM 

CO 

CD 

rH 

C"- 

VO 

CM 

CD 

in 

VO 

LO 

on 

CM 

ov 

CM 

VO 

CM 

vo 

r- 

-=r 

vo 

CD 

0- 

CD 

vo 

-=r 

o 

rH 

CM 

vo 

on 

in 

-=T 

CM 

C^- 

in 

on 

i — 1 

-=r 

t — i 

CM 

i — 1 

VO 

l n 

CM 

-=T 

oo 

o 

O 

C- 

rH 

89.9 

VO 

1 — 1 

i — i 

rH 

0- 

vo 

on 

o 

rH 

-=T 

rH 

95.5 

in 

i — i 

CM 

VO 

rH 

vo 

on 

o 

1.25 

95.6 

^r 

i — i 

CM 

i — i 

rH 

CVJ 

on 

on 

r— 1 

i — i 

on 

rH 

rH 

CM 

1.40 

1.98 

CM 

on 

CO 

on 

i — i 

CM 

C^- 

CD 

on 

CO 

Ov 

CO 

CM 

CM 

-=T 

CM 

CD 

CM 

CD 

CO 

cd 

O 

i — 1 

VO 

in 

vo 

CO 

VO 

CM 

CD 

in 

VO 

CD 

CM 

rH 

CD. 

CM 

CM 

i — 1 

r — 1 

rH 

rH 

i — 1 

i — 1 

CM 

i — 1 

0- 

0 .60 

-=r 

-=r 

rH 

92.6 

r — 1 
i — 1 

-=? 

cd 

- 

t>- 

on 

rH 

CM 

rH 

CM 

LTV 

CM 

-=T 

C"- 

CO 

on 

-=r 

on 

-=r 

rH 

3.12 

2.60 

61.6 

O 

i — 1 

rH 

■=r 

i — i 

i — 1 

CO 

CM 

O 

-=T 

OO 

O 

in 

in 

CD 

CD 

rH 

rH 

CM 

in 

vo 

on 

o 

1.08 

vo 

in 

CD 

CO 

-=r 

rH 

CM 

rH 

-=T 

rH 

CM 

in 

o 

vo 

rH 

rH 

93.6 

VO 

CM 

in 

rH 

CvJ 

LTV 

rH 

rH 

-=r 

-=r 

CM 

VO 

CM 

CM 

CO 

CM 

vo 

CD 

vo 

tzt 

CD 

CM 

2.73 

on 

-=r 

vo 

VO 

in 

VO 

CO 

in 

CO 

cd 

O 

rH 

O 

i — 1 

O 

rH 

CD 

o 

rH 

CD 

cd 

rH 

cd 

CO 

CM 

r-- 

CD 

0- 

CO 

vo 

o 

-=r 

CO 

CM 

in 

in 

rH 

LH 

CO 

CM 

rH 

-=T 

CM 

CM 

rH 

-=r 

CM 

rH 

on 

o 

CM 

rH 

1.91 

rH 

in 

CO 

in 

CM 

rH 

in 

rH 

on 

rH 

on 

CM 

rH 

CO 

CM 

rH 

CM 

r — 1 

-=r 

CO 

on 

CM 

on 

rH 

on 

on 

in 

CM 

or 

cr 

VO 

i — i 

CM 

on 

oo 

vo 

in 

CD 

2.60 

on 

vo 

CM 

68.8 

CM 

on 

CJ 

rH 

rH 

rH 

CM 

on 

o 

in 

C"- 

o 

96.0 

rH 

rH 

•^r 

i — 1 

CO 

vo 

in 

o 

1.76 

O 

on 

CD 

SPEECH 

TAPE 

rH 

OJ 

m 

m 

VO 

00 

CD 

O 

- 

CvJ 

m 

ID 

vo 

r^- 

GO 

CD 

o 

CvJ 

CvJ 

CvJ 

CJ 

ro 

CJ 

CJ 

m 

CJ 

CO 

c 

o 

CO 

|  SCORE 

69 


Errors  made  by  10  listeners  at  each  word  location  of  Program  Tape11- 


WORD  LOCATION 

o 

-=r 

00 

OJ 

in 

rH 

o 

^r 

o 

CM 

rH 

rH 

CM 

in 

ON 

O' 

cr 

r- 

rH 

rH 

rH 

rH 

rH 

rH 

vo 

00 

rH 

in 

rH 

in 

on 

0- 

CM 

0- 

r — 1 

-=T 

on 

CM 

on 

ON 

0- 

CO 

CC 

rH 

in 

00 

i — 1 

on 

t*- 

1 — 1 

oo 

on 

rH 

00 

OO 

■=T 

-=T 

rH 

^r 

00 

oo 

CN 

00 

oo 

0- 

Cr 

-=r 

1 — 1 

rH 

-=r 

O 

-=T 

O 

00 

rH 

rH 

93-1 

VO 

on 

CTn 

oo 

VO 

o- 

-^r 

O 

r-H 

in 

in 

o 

r-H 

on 

oo 

in 

OO 

o- 

o 

rH 

on 

CTn 

ON 

1 — 1 

CN 

0- 

VO 

oo 

c- 

vo 

in 

vo 

in 

t"- 

CM 

rH 

00 

00 

in 

on 

C-- 

rH 

00 

i — i 

vo 

rH 

oo 

-=r 

00 

vo 

rH 

i — i 

00 

CO 

rH 

in 

CTN 

rH 

oo 

oo 

CM 

0- 
( — ) 

on 

in 

in 

vo 

-3- 

on 

- 

OO 

00 

VO 

cr 

LP 

-=r 

-=r 

CO 

i — 1 

o 

rH 

on 

vo 

CC 

-=r 

00 

CM 
l n 

on 

o 

CM 

on 

0- 

0~ 

in 

on 

on 

OO 

vo 

OO 

o 

rH 

OS 

OS 

r- 

o 

o 

rH 

oo 

0- 

CTN 

ON 

c^- 

cc 

oo 

0- 

CO 

O 

rH 

oo 

oo 

OO 

oo 

o 

oo 

o- 

o 

rH 

ON 

in 

CM 

on 

i — i 

-=r 

o 

o 

O 

CM 

C 

in 

ON 

CT' 

rH 

on 

t — 1 

00 

rH 

vo 

1 — i 

-=r 

-=r 

o 

vo 

00 

rH 

vo 

ON 

o 

on 

VO 

on 

OO 

O 

rH 

in 

o- 

oo 

rH 

r-H 

i — 1 

in 

on 

00 

in 

00 

-=r 

CO 

CM 

O 

rH 

on 

66 . 0 

ON 

C\J 

00 

on 

■=r 

on 

-=T 

00 

00 

0- 

o 

vo 

on 

rH 

CM 

rH 

ON 

oo 

CM 

rH 

O'. 

o 

^r 

o 

o 

oo 

rH 

o 

in 

CN 

00 

oo 

ON 

00 

oo 

-=r 

o 

1 — 1 

-3- 

on 

oo 

vo 

in 

0- 

ON 

os 

oo 

ON 

CO 

on 

oo 

oo 

0- 

o 

o 

vo 

CM 

rH 

on 

on 

oo 

OJ 

VO 

CM 

1 — 1 

00 

on 

-cr 

CM 

O 

00 

t"- 

O 

o 

o- 

ON 

in 

oo 

rH 

rH 

0- 

VO 

on 

o 

rH 

-=r 

r — l 

in 

in 

ON 

-=r 

oo 

on 

in 

O 

rH 

oo 

o 

r-H 

oo 

OS 

cr 

oo 

O 

rH 

m 

rH 

OS 

cr 

CO 

CM 

in 

-=T 

rH 

0- 

vo 

vo 

i — 1 

^r 

o 

in 

CM 

-=r 

on 

vo 

oo 

on 

on 

00 

VO 

00 

on 

o 

rH 

VO 

0- 

Os 

O' 

vo 

in 

■=r 

rH 

O'. 

ON 

oo 

vo 

on 

CM 

O' 

on 

ON 

rH 

LP 

00 

00 

VO 

o 

1 — 1 

oo 

VO 

OO 

oo 

VO 

OO 

vo 

O 

vo 

O 

r-H 

vo 

OO 

o 

rH 

O 

cr 

I  00 

vo 

in 

OO 

ON 

OO 

on 

t 

c 

o 

oo 

rH 

00 

CO 

LP 

rH 

rH 

00 

i — 1 

00 

rH 

00 

^r 

00 

o 

O 

vo 

o 

O 

0- 

CN 

SPEECH 

TAPE 

rH 

OJ 

ro 

m 

vo 

00 

OS 

O 

— 

00 

ro 

vr 

m 

vo 

h- 

00 

os 

o 

C\J 

OsJ 

OJ 

OJ 

ro 

OJ 

<3- 

OJ 

m 

OJ 

GO 

c 

o 

CO 

|  SCORE 

70 


Appendix  5,  cont. 

Errors  made  by  10  listeners  at  each  word  location  of  Program  Tape  II- 

60  ; 

o 

o 

o 

o 

o 

o 

ioo.  q 

cd 

LD. 

on 

i— 1 

on 

VO 

on 

CM 

CVJ 

i — 1 

rH 

rH 

rH 

0.96 

CD 

^r 

rH 

88.3 

CO 

in 

rH 

rH 

CO 

i — 1 

rH 

C^- 

•=r 

CM 

CD 

O 

2.16 

oo 

CO 

CO 

0- 

in 

i— 1 

C\J 

CM 

rH 

O 

on 

CM 

o 

in 

CO 

CO 

VO 

in 

O 

o 

o 

o 

o 

o 

V - f 

o 

o 

rH 

WORD  LOCATION 

in 

in 

■ — i 

C\J 

on 

in 

rH 

on 

CM 

CM 

1 — 1 

CO 

rH 

rH 

VO 

on 

rH 

oo 

CD 

rH 

vo 

on 

CO 

^r 

in 

CVJ 

in 

in 

CO 

rH 

VO 

i — 1 

rH 

rH 

CD 

-3- 

rH 

on 

CM 

vo 

CD 

i — 1 

OO 

vo 

CM 

m 

vo 

0- 

on 

in 

i — i 

C-- 

in 

on 

CM 

VO 

-=3- 

i — 1 

- 

rH 

-=T 

CD 

2.16 

CD 

CM 

CM 

D- 

CM 

in 

-=r 

in 

o 

rH 

rH 

-=T 

rH 

VO 

i — 1 

CM 

VO 

0- 

CD 

o 

1 — 1 

CO 

on 

in 

CM 

rH 

O 

on 

CM 

-=T 

on 

CM 

CD 

in 

i — i 

in 

-3- 

0- 

ON 

- 

CVJ 

CM 

CO 

in 

oo 

VO 

rH 

in 

vo 

Cr 

O 

rH 

CO 

-=T 

on 

3.39 

on 

CO 

in 

o 

n 

C\J 

-=r 

rH 

■=r 

CM 

CM 

in 

o 

CD 

rH 

i — 1 

on 

CD 

cn 

-=T 

rH 

rH 

CO 

o 

o 

oo 

CM 

O 

o 

CD 

CD 

CO 

in 

( — i 

in 

0- 

-=r 

rH 

VO 

on 

CM 

O 

rH 

rH 

CM 

CM 

-=T 

OO 

•=r 

rH 

m 

•=r 

oo 

CM 

in 

CO 

CM 

CO 

in 

vo 

D- 

C\J 

CO 

O 

O 

o 

•=r 

o 

99.0 

VO 

-=r 

rH 

-=r 

o 

o 

o 

CM 

O 

in 

CD 

CD 

in 

rH 

rH 

on 

o 

CM 

O 

in 

vo 

o 

m 

0- 

CD 

^r 

-=r 

i— i 

m 

C\J 

VO 

(— i 

O 

i — 1 

CM 

-=T 

VO 

CM 

on 

on 

on 

CD 

t*- 

-J 

LP 

rH 

OO 

VC 

on 

o 

CD 

CM 

CO 

in 

in 

on 

-=r 

C\J 

CM 

rH 

-=r 

vo 

on 

o 

in 

CD 

o 

in 

in 

CD 

cm 

-=T 

m 

o 

CM 

o 

o 

o 

rH 

in 

c — 

CD 

i — 1 

-3- 

CD 

00 

CO 

VO 

c*- 

D- 

cd 

CD 

0- 

VO 

VO 

rH 

CD 

rH 

OO 

VO 

OO 

IT 

Is- 

0- 

CO 

VO 

CD 

vo 

in 

vo 

o 

in 

CM 

-=r 

rH 

CM 

IspeechI 

LU 

CL 

£ 

CM 

ro 

M- 

un 

co 

00 

CD 

o 

- 

CM 

ro 

IT) 

CO 

1^- 

00 

CD 

o 

CM 

CM 

CM 

CM 

ro 

CM 

M- 

CM 

m 

CM 

CO 

cz 

o 

CO 

SCORE 

71 


Errors  made  by  10  listeners  at  each  word  location  of  Program  Tape  III. 


WORD  LOCATION 

20 

10 

on 

CD 

in 

vo 

cd 

10 

OO 

ITS 

10 

CD 

CO 

10 

OO 

10 

vo 

CO 

10 

vo 

in 

CO 

CD 

in 

CO 

CO 

OO 

-=T 

2.33  1 

10.9 

CD 

(—1 

10 

VO 

rH 

rH 

vo 

CO 

0- 

-=T 

vo 

10 

in 

rH 

1 — 1 

on 

•=T 

2.68 

^r 

CO 

on 

CD 

0- 

vo 

CO 

i — 1 

cd 

■=T 

on 

CO 

rH 

on 

on 

CO 

C-- 

on 

on 

-H" 

-=r 

CD 

on 

rH 

vo 

rH 

in 

on 

CO 

CO 

on 

•=r 

O 

-=r 

vo 

t>- 

co 

vo 

rH 

in 

C'- 
1 — 1 

rH 

rH 

rH 

-3- 

l n 

1 — 1 

CO 

in 

o 

1.26 

93.6 

16 

on 

10 

in 

VO 

o 

i — i 

CO 

0- 

CO 

10 

vo 

vo 

CD 

10 

CO 

CO 

vo 

■=r 

vo 

CO 

CO 

on 

vo 

on 

0— 

rH 

in 

i — i 

on 

CO 

rH 

rH 

rH 

0.32 

0.75 

O 

VO 

CD. 

-=r 

r — I 

rH 

rH 

CO 

0.16 

C-- 

-=T 

O 

o 

co 

CD 

13 

i — i 

-=r 

t — 1 

rH 

on 

rH 

rH 

OO 

-3- 

o 

1.00 

CO 

^r 

O'. 

12 

CO 

CO 

i — i 

CO 

rH 

rH 

CO 

CO 

on 

CD 

1.00 

1.91 

CO 

0~- 

OO 

11 

C\J 

rH 

rH 

rH 

0.20 

o 

in 

o 

in 

0- 

CD 

o 

rH 

on 

CD 

CVJ 

CVJ 

-=r 

vo 

o 

,1.91 

o 

CO 

CD 

rH 

rH 

-=r 

CO 

■=r 

CO 

o 

2.17 

89.7 

CO 

- 

rH 

on 

CO 

on 

o 

CD 

CD 

O 

96.0 

(M 

rH 

r — 1 

CO 

rH 

CO 

vo 

on 

o 

O 

o 

vo 

m 

CD 

VO 

rH 

rH 

rH 

on 

CO 

in 

o 

1.50 

in 

on 

CD. 

LT\ 

rH 

-=r 

o 

o 

o 

CO 

o 

99.5 

-=T 

VO 

rH 

CO 

CO 

o 

1.20 

in 

vo 

CD 

on 

on 

r — 1 

CO 

■=r 

CO 

o 

0.72 

o 

D — 
CD . 

CO 

r— 1 

rH 

CO 

rH 

rH 

OO 

CO 

o 

CO 

o 

1.78 

on 

o 

CD 

i — 1 

CD 

rH 

u r 

1  vo 

i — i 

O J 

on 

1-52 

2.62 

CD 

* 

rH 

CO 

SPEECH 

TAPE 

pH 

C\J 

m 

VO 

GO 

CD 

O 

— 

CO 

ro 

*3“ 

m 

CO 

N- 

OO 

CD 

20 

OO 

OO 

OO 

23 

24 

25 

CO 

c 

:e 

SO 

SCORE 

72 


Errors  made  by  10  listeners  at  each  word  location  of  Program  Tapem. 


o 

-=T 

on 

r— 1 

CM 

-=T 

rH 

rH 

CM 

vo 

CM 

CO 

oo 

o 

-=r 

P 

rH 

•=T 

OM 

oo 

C 

Cr 

-=r 

i — i 

CM 

rH 

rH 

1 — 1 

o 

-=r 

o 

rH 

CM 

o 

rH 

P 

OM 

CO 

0^ 

rH 

-=r 

o 

o 

O 

CM 

O 

P 

OM 

OM 

t>- 

Cn 

LP 

c — 

i — i 

CM 

P 

O 

1.69 

P 

on 

OM 

VO 

pn 

i — 1 

rH 

-=r 

CM 

CM 

on 

o 

O 

OM 

o 

o 

VO 

OM 

P 

on 

OM 

CM 

O 

r — 1 

C— 

o 

• — i 

CM 

OO 

O 

r — 1 

VO 

on 

rH 

P 

on 

CM 

CO 

VO 

P 

vo 

0- 

oo 

vo 

-=T 

CM 

P 

on 

o 

^r 

-=r 

WORD  LOCATION 

-=T 

on 

CM 

i — 1 

CM 

rH 

^r 

OJ 

O 

o 

vo 

o 

o 

C-- 

CM 

on 

on 

CM 

c 

f — 1 

O 

i — 1 

i — 1 

VO 

rH 

OM 

rH 

VO 

-=X 

rH 

-=r 

o 

CM 

f- 

on 

on 

P 

P 

(V 

on 

LP 

OM 

on 

i — 1 

om 

IP 

OM 

rH 

-=r 

rH 

VO 

CM 

vo 

OO 

VO 

O- 

OO 

on 

vo 

VO 

OM 

-=r 

rH 

OM 

CM 

VO 

O 

-=X 

f — i 

on 

on 

o 

■ — i 

CM 

oo 

p 

OM 

r— 1 

i>- 

P 

OM 

VO 

CO 

OO 

on 

P 

0- 

OM 

vo 

p 

rH 

-=r 

oo 

CM 

P 

P 

CO 

CM 

OO 

VO 

on 

o 

on 

i — i 

< — i 

CM 

on 

vo 

o 

oo 

o 

vo 

rH 

CM 

CM 

o 

OM 

cr\ 

C\J 

Or 

rH 

rH 

o 

CM 

o 

P 

VO 

O 

p 

0- 

CM 

oo 

CVI 

on 

rH 

on 

rH 

CM 

on 

o 

P 

OO 

o 

o 

vo 

OM 

0- 

CVI 

CM 

i — 1 

P 

rH 

P 

on 

rH 

CM 

rH 

CM 

rH 

CM 

CM 

on 

rH 

OM 

CO 

rH 

CM 

-=T 

oo 

VO 

CM 

i — i 

CT 

-=r 

CM 

VO 

CM 

P 

ZT 

vo 

rH 

rH 

rH 

VO 

P 

P 

OO 

O 

CM 

rH 

on 

CM 

CM 

P 

0- 

P 

CM 

c^- 

i — 1 

O' 

O 

i—l 

CM 

00 

OO 

VO 

VO 

•=r 

O 

rH 

OO 

-=r 

O' 

CO 

vo 

VO 

CM 

rH 

vo 

VO 

C^- 

-=r 

on 

on 

on 

oo 

CM 

^=r 

-^r 

CM 

OO 

rH 

rH 

rH 

-=r 

-=r 

o 

rH 

VO 

rH 

p 

-=X 

OM 

O' 

CM 

1 — 1 

r — t 

CT 

>  r— 

1  oc 

1  -=J 

rH 

CO 

VO 

O 

rH 

P 

CT 

vo 

vo 

0- 

CM 

on 

P 

on 

oo 

• 

vo 

vo 

CM 

CM 

O' 

) 

CM 

J  CM 

J 

Or 

)  CM 

rH 

t— 

i 

VO 

p 

o 

o 

o 

rH 

rH 

on 

OM 

i — 

CM 

1  o- 
1 

-  c 

'  C 

i — 

)  C 

1  r- 

) 

i 

c 

r~ 

>  ir 

>  r- 

i 

rH 

1  o- 

-  O' 

\  OO 

rH 

o 
1 — 1 

-=r 

OC 

>  VC 

>  VC 

)  oc 

>  OC 

3  O' 

CM 

1  LT 

'  O' 

vo 

o 

p 

VO 

O' 

O' 

)  oo 

•  oo 

CM 

IspffchI 

>  LO 

1  CL 

► 

•—t 

CVI 

ro 

p 

vo 

CD 

<X> 

o 

- 

CMJ 

ro 

p 

CO 

0- 

00 

CT> 

o 

CMJ 

CVI 

CVJ 

CVI 

ro 

CVI 

oo 

p 

CVJ 

CO 

a 

o 

CO 

SCORE 

73 


Errors  made  by  10  listeners  at  each  word  location  of  Program  Tape  m. 


WORD  LOCATION 

60 

rH 

rH 

00 

0 

0 

0 

-=r 

0 

O 

av 

OV 

CT 

LT 

0 

0 

0 

0 

0 

0 

TT 

O 

O 

rH 

OO 

IT 

or 

rH 

-=r 

r^- 

CM 

CM 

on 

on 

0 

0 

rH 

1.78 

rH 

OO 

OO 

0- 
l n 
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Appendix  6 


Calculation  of  Articulation  Index  for  Word 
Location  52  of  Propram  Tape  I.  (See  also  Fig.  1-6.) 


Band 

No. 

Speech 

Level 

Noise 

Level 

Difference 

Weight 

Contribution 
to  AI 

1 

70 

49 

21 

0.0003 

0.0063 

2 

71 

54 

17 

0.0007 

0.0119 

3 

72 

56 

16 

0.0010 

0.0160 

4 

74 

51 

23 

0.0016 

0.0368 

5 

72 

58 

14 

0.0017 

0.0238 

6 

70 

59 

11 

0.0017 

0.0187 

7 

68 

53 

15 

0.0027 

0.0405 

8 

67 

57 

10 

0.0030 

0.0300 

9 

65 

62 

3 

0.0033 

0.0099 

10 

63 

61 

2 

0.0037 

0.0074 

11-14 

- 

- 

0 

- 

0.0000 

15 

54 

46 

8 

0.0017 

0.0136 

AI  -*• 

0.2149 
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Appendix  7 


Tabulation  of  Articulation  Indexes  for  60 
word  locations  of  each  of  3  Program  Tapes. 


Word 

Location 

Prc 

I 

Dgram  Ta] 
II 

~>e 

III 

Word 

Location 

Pr< 

I 

Dgram  Tap 
II 

>e 

III 

1 

0.928 

0.735 

0.207 

31 

0.337 

0.723 

0.002 

2 

0.767 

O 

CT\ 

\jn 

a\ 

0.239 

32 

0.059 

0.642 

0.052 

3 

0.839 

0.131 

0.498 

33 

0.121 

0.000 

0.498 

4 

0.107 

0.211 

0.630 

34 

0.235 

0.081 

0.453 

5 

0.813 

0.377 

0.761 

35 

0.337 

0.184 

0.138 

6 

0.000 

0.004 

0.498 

36 

0.742 

CO 

on 

0 

0 

0.747 

7 

0.368 

0.083 

0.54  4 

37 

0.027 

0.838 

0.777 

8 

0.205 

0.717 

O 

CO 

m 

0 

38 

0.000 

0.231 

0.740 

9 

0.649 

0.495 

0.458 

39 

0.744 

0.108 

0.620 

10 

0.147 

0.783 

0.569 

40 

0.144 

0.309 

0.617 

11 

0.321 

0.052 

0.477 

41 

0.002 

0.012 

0.170 

12 

0.000 

0.618 

0.281 

42 

0.137 

0.749 

0.720 

13 

0.444 

0.021 

0.436 

43 

0.203 

0.300 

0.753 

14 

0.593 

0.533 

0.412 

44 

0.647 

0.051 

0.181 

15 

0.258 

0.703 

0.593 

45 

0.548 

0.690 

0.223 

16 

0.031 

0.760 

0.242 

46 

0.199 

0.712 

0.664 

17 

0.494 

0.285 

0.492 

47 

0.746 

0.674 

0.144 

18 

0.434 

0.020 

0.054 

48 

0.029 

0.162 

0.048 

19 

0.221 

0.275 

0.072 

49 

0.143 

0.788 

0.056 

20 

0.506 

0.205 

0.025 

50 

0.751 

0.661 

0.113 

21 

0.357 

0.394 

0.024 

51 

0.110 

0.066 

0.281 

22 

0.171 

0.000 

0.718 

52 

0.215 

0.160 

0.715 

23 

0.028 

0.000 

0.017 

53 

0.195 

0.106 

0.715 

24 

0.000 

0.143 

0.785 

54 

0.221 

0.137 

O.63O 

25 

0.542 

0.735 

0.045 

55 

0.006 

0.188 

0.008 

26 

0.434 

0.674 

0.051 

56 

0.378 

0.727 

0.000 

27 

0.059 

0.049 

0.169 

57 

0.000 

0.536 

0.004 

28 

0.089 

0.567 

0.212 

58 

0.753 

0.241 

0.187 

29 

0.067 

0.735 

0.532 

59 

0.003 

0.257 

0.756 

30 

0.534 

0.254 

0.558 

60 

0.162 

0.801 

0.826 
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Appendix  8 


A.  Word  locations  where  predicted  test  scores  were  too  low , 
tabulated  according  to  probable  cause  of  error.  Underlined 
word  locations  are  illustrated  in  Fig.  1-11. 

Cause  1:  Speech  level  read  too  low 

I- 12,  1-16,  1-38,  1-41,  1-57,  1-59, 

II- 6  ,  11-18,  11-22  , 

III- 57. 

Cause  2:  Noise  level  read  too  high 

II- 39, 

III- 28. 

Cause  1  and  Cause  2  combined 

1-6, 

II- ll,  11-23,  11-44, 

III- 19,  III-23,  III-26,  III-31,  HI-41. 

B.  Word  locations  where  predicted  test  scores  were  too  high , 
tabulated  according  to  probable  cause  of  error.  Underlined 
word  locations  are  illustrated  in  Fig.  1-12. 

Cause  1:  Speech  level  read  too  high 

I- 11,  1-23,  1-33,  1-40,  1-52, 

II- 24,  11-27, 

III- 20,  III-35. 

Cause  2:  Noise  level  read  too  low 

1-60, 

11-36, 

III-33,  III-44,  111-47. 

Cause  1  and  Cause  2  combined 

I- 8,  1-49,  1-54, 

II- 14,  11-30,  11-52 , 

III- 16,  III-48. 
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Appendix  9 


Harvard  PB-Word  scores  (in  percent)  corresponding!: 
to  various  Speech  Communication  Indexes  for  peak 
clipping  with  level  compensation. 


Clipping  Level 


SCI 

0  dB 

6  dB 

12  dB 

18  dB 

24  dB 

1.00 

100 

100 

100 

100 

100 

0.95 

99 

99 

99 

99 

99 

0.90 

98 

98 

99 

99 

99 

0.85 

97 

98 

98 

99 

99 

0.80 

96 

97 

98 

98 

99 

0.75 

95 

96 

98 

98 

99 

0.70 

94 

96 

97 

98 

98 

0.65 

93 

95 

97 

98 

98 

0.60 

92 

95 

96 

97 

97 

0.55 

91 

94 

96 

96 

96 

0.50 

90 

92 

95 

95 

95 

0.4  5 

88 

90 

94 

93 

93 

0.40 

86 

86 

90 

91 

91 

0.35 

83 

83 

86 

87 

88 

O 

on 

o 

78 

77 

82 

81 

83 

0.25 

66 

67 

73 

75 

77 

0.20 

50 

56 

63 

66 

69 

0.15 

35 

44 

51 

55 

59 

0.10 

18 

28 

39 

- 

- 

0.05 

2 

- 

- 

- 

- 

0.00 

- 

- 

- 

- 

- 
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